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1 Introduction 



Equilibrium statistical mechanics intends to describe and explain the macroscopic be- 
havior of systems in thermal equilibrium in terms of the microscopic interaction between 
their great many constituents. As a typical example, let us take some ferromagnetic ma- 
terial like iron; the constituents are then the spins of elementary magnets at the sites of 
some crystal lattice. Or we may think of a lattice approximation to a real gas, in which 
case the constituents are the particle numbers in the elementary cells of any partition of 
space. The central object is the Hamiltonian describing the interaction between these 
constituents. This interaction determines the relative energies between configurations 
that differ only microscopically. The equilibrium states with respect to the given interac- 
tion are described by the associated Gibbs measures. These are probability measures on 
the space of configurations which have prescribed conditional probabilities with respect 
to fixed configurations outside of finite regions. These conditional probabilities are given 
by the Boltzmann factor, the exponential of the inverse temperature times the relative 
energy. This allows one to compute, at least in principle, equilibrium expectations and 
spatial correlation functions following the standard Gibbs formalism. Most important 
are the so called extremal Gibbs measures since they describe the possible macrostates 
of our physical system. In such a state, macroscopic observables do not fluctuate while 
the correlation between local observations made far apart from each other decays to 
zero. 

Since the early days of statistical mechanics, geometric notions have played a role 
in elucidating certain aspects of the theory. This has taken many different forms. Ar- 
guably, the thermodynamic formalism, as first developed by Gibbs, already admits 
some geometric interpretations primarily related to convexity. For example, entropy is 
a concave function of the specific energy, the pressure is convex as a function of the 
interaction potential, the Legendre-Fenchel transformation relates various fundamental 
thermodynamic quantities to each other, and the set of Gibbs measures for an interac- 
tion is a simplex with vertices corresponding to the physically realized macrostates, the 
equilibrium phases. 

Here, however, we will not be concerned with this kind of convex geometry which 
is described in detail e.g. in the books by Israel [138] and Georgii |9(|. Rather, the 



geometry considered here is a way of visualizing the structure in the typical realizations 
of the system's constituents. To be more specific let us consider for a moment the case 
of the standard ferromagnetic Ising model on the square lattice. At each site we have 
a spin variable taking only two possible values, +1 and —1. The interaction is nearest- 
neighbor and tends to align neighboring spins in the same direction. By the ingenious 
arguments first formulated in 1936 by Peierls |187| (see also (6^, |213| , |96|1), the phase 
transition in this model can be understood from looking at the typical configurations 
of contours, i.e., the broken lines separating the domains with plus resp. minus spins. 
The plus phase (the positively magnetized phase) is realized by an infinite ocean of 
plus spins with finite islands of minus spins (which in turn may contain lakes of plus 
spins, and so on). On the other hand, above the Curie temperature (first computed by 
Onsager) there is no infinite path joining nearest neighbors with the same spin value. 
So, for this model the geometric picture is rather complete (as we will show later). In 
general, however, much less is known, and much less is true. Still, certain aspects of 
this geometric analysis have wide applications, at least in certain regimes of the phase 
diagram. These 'certain regimes' are, on the one hand, the high-temperature (or, in a 
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lattice gas setting, low-density) regime and, to the other extreme, the low temperature 
behavior. 

At high temperatures, all thermodynamic considerations are based on the fact that 
entropy dominates over energy. That is, the interaction between the constituents is not 
effective enough to enforce a macroscopic ordering of the system. As a result, every 
constituent is more or less free to behave at random, not much influenced by other 
constituents which are far apart. So, the system's behavior is almost like that of a free 
system with independent components. This means, in particular, that in the center of 
a large box we will typically encounter more or less the same configurations no matter 
what boundary conditions outside this box are imposed. That is, if we compare two 
independent realizations of the system in the box with different boundary conditions 
outside then, still at high temperatures, the difference between the boundary conditions 
cannot be felt by the spins in the center of the box; specifically, there should not exist 
any path from the boundary to the central part of the box along which the spins of 
the two realizations disagree. This picture is rather robust and can for example also be 
applied when the interaction is random; see Sections and ^. 

At low temperatures, or large densities (when the interaction is sufficiently strong), 
the picture above no longer holds. Rather, the specific characteristics of the interaction 
will come into play and determine the specific features of the low temperature phase. In 
many cases, the low temperature behavior can be described as a random perturbation of 
a ground state, i.e., of a fixed configuration of minimal energy. Then we can expect that 
at low temperatures, and sometimes even up to the critical temperature, the equilibrium 
phases are realized as a deterministic ground state configuration, perturbed by finite 
random islands on which the configuration disagrees with the ground state. This means 
that the ground state pattern can percolate through the space to infinity. One prominent 
way of confirming this picture is provided by the so called Pirogov-Sinai theory which 
is described in detail e.g. in 230 1. In Section || we will discuss some other techniques of 
establishing the same geometric picture. 

It is evident from the above that percolation theory will play an important role in 
this text. In fact, we will mainly be concerned with dependent percolation, but one 
can say that independent percolation stands as a prototype for the study of statistical 
equilibrium properties in geometric terms. In independent percolation, the model is 
extremely simple: the components are binary-valued and independent from each other. 
What is hard is the type of question one asks, namely the question of existence of 
infinite paths of l's and their geometry. We will introduce percolation below but refer 
to other publications (such as the book by Grimmett [108]) for a systematic account of 
the theory. 

Percolation will come into play here on various levels. Its concepts like clusters, 
open paths, connectedness etc. will be useful for describing certain geometric features of 
equilibrium phases, allowing characterizations of phases in percolation terms. Examples 
will be presented where the (thermal) phase transition goes hand in hand with a phase 
transition in an associated percolation process. Next, percolation techniques can be 
used to obtain specific information about the phase diagram of the system. For example, 
equilibrium correlation functions are sometimes dominated by connectivity functions in 
an associated percolation problem which is easier to investigate. Finally, representations 
in terms of percolation models will yield explicit relations between certain observables in 
equilibrium models and some corresponding percolation quantities. In fact, the resulting 
percolation models, like the random-cluster model, have some interest in their own right 
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and will also be studied in some detail. 

This text is supposed to be self-contained. Therefore we need to introduce various 
concepts and techniques which are well-known to some readers. On the other hand, 
important related issues will not be discussed when they are not explicitly needed. For 
more complete discussions on the introductory material we will refer the interested 
reader to other sources. More seriously, we will not include here a discussion of some 
important geometric concepts developed in the 1980's for the study of critical behavior 
in statistical mechanical systems, namely random walk expansions or random current 
representations. Fortunately, we can refer to an excellent book [77] where the interested 
reader will find all the relevant results and references. Important steps in this context 
include || ||, [?], |TT], 4C , 41] and the references therein. 

Finally, to avoid misunderstanding, the random geometry in the title of this work 
should not be confused with stochastic geometry (or geometric probability) which, as 
a branch of integral geometry, provides a very interesting tool-box for the discussion 
of morphological characteristics of random fields appearing in statistical physics and 
beyond, see p5|, [l72|, p| . 

Acknowledgements: It is a pleasure to thank J. Lebowitz for suggesting (and in- 
sisting) that we should write this report. We are also grateful to L. Chayes, A. CD. 
van Enter and J. Lorinczi who looked at parts of the manuscript and made numerous 
suggestions, and to Y. Higuchi for discussions on Proposition 8.5. 
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2 Equilibrium phases 



2.1 The lattice 

Our object of study are physical systems with many constituents, spins or particles, 
which will be located at the sites of a crystal lattice C The standard case is when 
C = Z d , the d-dimensional hypercubic lattice. In general, we shall assume that C is 
the vertex set of a countable graph. That is, C is an at most countable set and comes 
equipped with a (symmetric) adjacency relation. Namely, we write x ~ y if the vertices 
x, y £ C are adjacent, and this is visualized by drawing an edge between x and y. In 
this and y are also called neighbors, and the edge (or bond) between x and 

y is denoted by (xy). We write B for the set of all edges (bonds) in C. A complete 
description of the graph is thus given by the pairs (C, ~) or (C,B). 

In the case C = Z d , the edges will usually be drawn between lattice sites of distance 
one; hence x ~ y whenever |x — y\ = 1. Here, | • | stands for the sum-norm, i.e. 
\x\ = YLi=i \ x i\ whenever x = (xi, . . . ,Xd) € Z d . This choice is natural because then 
\x — y\ coincides with the graph-theoretical (or lattice) distance, viz. the length of the 
shortest path (consisting of consecutive edges) connecting x and y. For convenience, 
we sometimes use the same notation in the case of a general graph. On Z d we will 
occasionally distinguish between the standard metrics d\(x,y) = J2i \ x i ~DiV d2(x,y) = 
[J2i=i( x i ~ Vi) 2 )] 1 ^ 2 an d d OQ (x,y) = maxj \xi — yi\. Given any metric d on C, we write 
d(A, A) = inf x 

eA,yeA d(x, y) for the distance of two subsets A, A C C 
We will always assume that the graph (£, ~) is locally finite, which means that each 
x 6 C has only a finite number N x of nearest neighbors. In other words, N x is the 
number of edges emanating from x. N x is also called the degree of the graph at x. In 
many cases we will even assume that (£, ~) is of bounded degree, which means that 
N = sup^g^ N x < co. Common examples of such graphs, besides Z d , are the triangular 
lattice in two dimensions, and the regular tree (also known as the Cayley tree or the 
Bethe lattice), which is defined as the (unique) infinite connected graph containing no 
circuits in which every vertex has exactly d + 1 nearest neighbors. 

A region of the lattice, that is a subset A C C, is called finite if its cardinality |A| is 
finite. We write £ for the collection of all finite regions. The complement of a region A 
will be denoted by A c = C \ A. The boundary <9A of A is the set of all sites (vertices) 
in A c which are adjacent to some site of A. 

At some occasions we will need the notion of thermodynamic (or infinite volume) 
limit, and we need to describe in what sense a region A G £ grows to the full lattice C. 
For our purposes, it will in general be sufficient to take an arbitrary increasing sequence 
(A„) with Un>i An = £• In the case C = Z d , we will often make the standard choice 
A n = [—n, n] d n Z d , the lattice cubes centered around the origin. As £ is a directed set 
ordered by inclusion, we will occasionally also consider the limit along £. In each of 
these cases we will use the notation A f C 

2.2 Configurations 

The constituents of our systems are the spins or particles at the lattice sites. So, at each 
site x 6 C we have a variable a(x) taking values in a non-empty set S, the state space or 
single-spin space. In a magnetic set-up (to which we mostly adhere for simplicity), a(x) 
is interpreted as the spin of an elementary magnet at x. In a lattice gas interpretation, 
there is a distinguished vacuum state 6 S representing the absence of any particle, 
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and the remaining elements correspond to the types and/or the number of the particles 
at x. Unless stated otherwise, we will always assume that S is finite. Elements of S will 
typically be denoted by a, b, . . .. 

A configuration is a function a : C — > S which assigns to each vertex x G C a spin 
value a(x) G S. In other words, a configuration a is an element of the product space 
$7 = S c . Q is called the configuration space and its elements are in general written as 

c, 77, £, (It is sometimes useful to visualize a,b . . . as colors. A configuration is then 

a coloring of the lattice.) A configuration a is constant if for some a £ S, o~(x) = a for 
all x G C Two configurations a and rj are said to agree on a region A C £, written 
as "cr = i] on A", if a(x) = rj(x) for all x G A. Similarly, we write "<r = 77 off A" if 
<r(x) = 7/(a;) for all x G" A. 

We also consider configurations in regions A C C. These are elements of S" A , again 
denoted by letters like er, 77, £, . . .. Given <r, 77 G Jl, we write a\r]\c for the configuration 
£ G Q. with £(x) = <j(x) for x G A and = 77(x) for x G A c . Then, obviously, £ = a 
on A. The cylinder sets 

M K (o) = {£ G n : £ = a on A} , 

A G £, form a countable neighborhood basis of ff 6 O; they generate the product 
topology on O. Hence, two configurations are close to each other if they agree on some 
large finite region, and a diagonal-sequence argument shows that O is a compact in this 
topology. 

We will often change a configuration cr G at just one site x G C. Changing o~(x) 
into a prescribed value a G S we obtain a new configuration written a x,a . In particular, 
for S = {—1, +1} we write 




o~(y) for y ^ x 
—cr(x) for 7/ = x 



for the configuration resulting from flipping the spin at x. 

We will also deal with automorphisms of the underlying lattice (£,~). Each such 
automorphism defines a measurable transformation of the configuration space Q. The 
most interesting automorphisms are the translations of the integer lattice C = Z d ; the 
associated translation group acting on Q is given by 9 x a(y) = o~(x + y), y G Z d . In 
particular, any constant configuration is translation invariant. Similarly, we can speak 
about periodic configurations which are invariant under 9 X with x in some sublattice of 
Z d . 

Later on, we will also consider configurations which refer to the lattice bonds rather 
than the vertices. These are elements 77 of the product space {0, and a bond b G B 
will be called open if 77(6) = 1, and otherwise closed. The above notations apply to this 
situation as well. 

2.3 Observables 

An observable is a real function on the configuration space which may be thought 
of as the numerical outcome of some physical measurement. Mathematically, it is a 
measurable real function on fl Here, the natural underlying cr-field of measurable 
events in Q is the product cr-algebra T = (J 7 o) c , where is the set of all subsets of 
S. T is defined as the smallest cr-algebra on f2 for which all projections X(x) : Q, — > S, 
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X(x)(a) = a(x) with a G fl and x G C, are measurable. It coincides, in fact, with the 
Borel a- algebra for the product topology on Q. 

We also consider events and observables depending only on some region A C C. We 
let J- a denote the smallest sub-cr-field of T containing the events A/a(c) for a G S A 
and A G £ with A C A. Equivalently, T\ is the er-algebra generated by the projections 
X(x) with x G A. J^a is the a-algebra of events occurring in A. 

An event A is called local if it occurs in some finite region, which means that A G J-\ 
for some A G £ . Similarly, an observable / : O — > R is called Zoca/ if it depends on only 
finitely many spins, meaning that / is measurable with respect to Ts. for some A G £ . 
More generally, an observable / is called quasilocal if it is (uniformly) continuous, i.e., 
if for all e > there is some A G £ such that \f(a) — < e whenever £ = cr on 

A. The set C(Q) of continuous observables is a Banach space for the supremum norm 
ll/H = sup CT |/(<t)| , and the local observables are dense in it. 

The local events and observables should be viewed as microscopic quantities. On 
the other side we have the macroscopic quantities which only depend on the collective 
behavior of all spins, but not on the values of any finite set of spins. They are defined 
in terms of the tail a -algebra T = f]\ e £^F\ c , which is also called the a-algebra of all 
events at infinity. Any tail event A G T and any T-measurable observable is called 
macroscopic. 

As a final piece of notation we introduce the indicator function I a of an event A; it 
takes the value 1 if the event occurs {Ia{o~) = 1 if o G A) and is zero otherwise. 

2.4 Random fields 

As the spins of the system are supposed to be random, we will consider suitable prob- 
ability measures [i on (Q, T). Each such fi is called a random field. Equivalently, the 
family X = (X(x),x G C) of random variables on the probability space {fl,J-,fi) which 
describe the spins at all sites is called a random field. 

Here are some standard notations concerning probability measures. The expectation 
of an observable / with respect to li is written as fJ-(f) = J fdfx. The probability of 
an event A is n{A) = h(Ia) = Ja ^> ana ^ we om it the set braces when A is given 
explicitly. For example, given any x G C and a G S we write /j,(X(x) = a) for fi(A) with 
A = {a G ft : a(x) = a}. Covariances are abbreviated as n(f;g) = fJ-(fg) — n(f)n(g). 

Whenever we need a topology on probability measures on Q, we shall take the weak 
topology. In this (metrizable) topology, a sequence of probability measures [i n converges 
to fx, denoted by Li n — ► fi, if Li n (A) — ► li(A) for all local events A G Uag^-^a- This 
holds if and only if /J. n (f) — ► //(/) for all local, or equivalently, all quasilocal functions 
/. In applications, Li n will often be an equilibrium state in a finite box A n tending to 
C as n — > oo, and we are interested in whether the probabilities of events occurring in 
some fixed finite volume have a well-defined thermodynamic (or bulk) limit. That is, we 
observe what happens around the origin (via the local function /) while the boundary 
of the box in which we realize the equilibrium state receeds to infinity. As there are 
only countably many local events, one can easily see by a diagonal-sequence argument 
that the set of all probability measures on ft is compact in the weak topology. 

2.5 The Hamiltonian 

We will be concerned with systems of interacting spins. As usual, the interaction is 
described by a Hamiltonian. As the spins are located at the sites of a graph (£, ~), it is 
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natural to consider the case of homogeneous neighbor potentials. (We will deviate from 
homogeneity in Section ^ when considering random interactions.) The Hamiltonian H 
then takes the form 

H{a) = J2 U(a(x),a(y)) + ]T V{a{x)) (1) 

x~y x 

with a symmetric function U : S x S — » RU{oo}, the neighbor-interaction, and a 
self-energy V : S — > R. The infinite sums are formal; the summation index x ~ y 
means that the sum extends over all bonds (xy) € B of the lattice. f7 thus describes the 
interaction between spins at neighboring sites, while V might come from the action of 
an external magnetic field. In a lattice gas interpretation when S = {0, 1} (the value 1 
being assigned to sites which are occupied by a particle), V corresponds to a chemical 
potential. 

To make sense of the formal sums in (|l]) we compare the Hamiltonian for two dif- 
ferent configurations a,n G f2 which differ only locally (or are "local perturbations" or 
"excitations" of each other), in that a = rj off some A G £ . For such configurations we 
can define the relative Hamiltonian 

H(a\ V ) = J2 [U(a(x),a(y))-U(r ] (x), V (y))]+Y / [V(a(x)) - V( V (x))] (2) 

x^y x 

in which the sums now contain only finitely man non-zero terms: the first part is over 
those neighbor pairs (xy) for which at least one of the sites belongs to A, and the second 
part is over all x G A. 

2.6 Gibbs measures 

Gibbs measures are random fields which describe our physical spin system when it is 
in macroscopic equilibrium with respect to the given microscopic interaction at a fixed 
temperature. Here, macroscopic equilibrium means that all parts of the system are in 
equilibrium with their exterior relative to the prescribed interaction and temperature. 
So it is natural to define Gibbs measures in terms of conditional probabilities. 

Definition 2.1 A probability measure fi on the configuration space SI is called a Gibbs 
measure for the Hamiltonian H in (Q) or ^) at inverse temperature f3 ~ 1/T if for all 
A £ £ and all a G £1, 

H{X = a on A | X = rj off A) = Mft A ( (7 ) ( 3 ) 

for [i-almost all r/ £ Q. In the above, a( <7 ) * s ^ e Boltzmann— Gibbs distribution 

in A for j3 and H , which is given by 

w = I{ z2Zi) exp[ -^ ((T|7?)] • (4) 

Here, Z\(f3,rj) is a normalization constant making fj/l ^ a probability measure, and the 
constraint that a has to coincide with n outside A is added because we want to realize 
these probability measures immediately on the infinite lattice. Note that \ii A in fact 
only depends on the restriction of rj to A c . 
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So, /x is a Gibbs measure if it has prescribed conditional distributions inside some finite 
set of vertices, given that the configuration is held fixed outside, and these conditional 
distributions are given by the usual Boltzmann-Gibbs formalism. This definition goes 



back to the work of Dobrushin [64| and Lanford and Ruelle [151] in the late 1960's, 
whence Gibbs measures are often called DLR-states. By this work, equilibrium sta- 
tistical physics and the study of phase transitions made firm contact with probability 
theory and the study of random fields. A thermodynamic justification of this definition 
can be given by the variational principle, which states that (in the case C = 7i d ) the 
translation invariant Gibbs measures are precisely those translation invariant random 



fields which minimize the free energy density, cf. |151| , |96| . For better distinction, the 
Gibbs distributions fjJl A are often called finite volume Gibbs distributions, whereas the 
Gibbs measures are sometimes specified as infinite volume Gibbs measures. 

We write G(f3H) for the set of all Gibbs measures with given Hamiltonian H and 
inverse temperature (3. In the special case U = of no interaction, there is only one 
Gibbs measure, namely the product measure with one-site marginals fj,(X(x) = a) 
= e~^ v ( a ^ / J2bcs e ~^ V ^ ■ I n general, several Gibbs measures for the same interaction 
and temperature can coexist. This is the fundamental phenomenon of nonuniqueness of 



phases which is one of our main subjects; we return to this point in Section 2.7 below. 

First we want to emphasize an important consequence of our assumption that the 
underlying interaction U involves only neighbor spins. Due to this assumption, the 
Gibbs distribution \il A only depends on the restriction t]q\ of rj to the boundary <9A of 
A, and this implies that each Gibbs measure \i G Q{f3H) is a Markov random field. By 
definition, this means that for each A G £ and a G 

fi(X = a on A | Fa c ) = (J>(X = a on A | Tqa), (5) 

fi— almost surely. This Markov property will be an essential tool in the geometric 
arguments to be discussed in this review. There is in fact an equivalence between 
Markov random fields and Gibbs measures for nearest neighbor potentials, see e.g. 



Averintsev [17], Grimmett [107] or Georgii [|9q [. 

As an aside, let us comment on the case when the interaction of spins is not nearest- 
neighbor but only decays sufficiently fast with their distance. The Boltzmann-Gibbs 
distributions in (|j), and therefore also Gibbs measures, can then still be defined, but 
the Gibbs measures fail to possess the Markov property (0). Rather their local condi- 
tional distributions fil A satisfy a weakening of the Markov property called quasilocality 
or almost-Markov property: for every A G £ and A G Fa, /il a (A) is a continuous 
function of n. So, in this case, Gibbs measures have prescribed continuous versions of 
their local conditional probabilities. To obtain a sufficiently general definition of Gibbs 
measures including this and other cases, one introduces the concept of a specification 
G = (Ga, A G £). This is a family of probability kernels Ga from (Q,^ r A c ) to (fi, F). 
Ga(',v) stands for any distribution of spins with fixed configuration t/a c & outside 
A; the standard case is the Gibbs specification G\(-,rj) = fil A . A Gibbs measure is then 
a probability measure fi on fi satisfying fJ,(A\ Ta c ) = Ga(A,-) /i-almost surely for all 
A G £ and A G T; this property can be expressed in a condensed form by the invariance 
equation [iGa = /x. In order for this definition to make sense the specification G needs 
to satisfy a natural compatibility condition for pairs of volumes A C A' expressing the 
fact that if the system in A' is in equilibrium with its exterior, then the subsystem in A 
is also in equilibrium with its own exterior. It is easy to see that the Gibbs distributions 
in (^j are compatible in this sense. Details and further discussion can be found in many 



10 



books and articles dealing with mathematical results in equilibrium statistical mechan- 
ics, including [135, [206| , 96, 195, 76, |213| . In |160(| , the relation between Gibbs measures 



and the condition of detailed balance (reversibility) in certain stochastic dynamics is 
explained. 

Finally, we mention an alternative and constructive approach to the concept of Gibbs 
measures. Starting from the finite- volume Gibbs distributions fil A , one might ask what 
kind of limits could be obtained if r\ is randomly chosen and A increases to the whole 
lattice C (This slightly older but still important approach was suggested by Minlos 
[]179[| .) To make this precise we consider the measures = J fi^ A p(dr]), where p 
is any probability measure on Q describing a "stochastic boundary condition". Any 
such p P p A is called a (finite volume) Gibbs distribution with respect to H at inverse 
temperature @, and their collection is denoted by Q\{j3H). The set of all (infinite 
volume) Gibbs measures is then equal to 

G(J3H) = p| Qk{(3H). 

Equivalently, a probability measure p on f2 is a Gibbs measure for the Hamiltonian (5H 
if it belongs to the closed convex hull of the set of limit points of \il A as A j C. 

One important consequence is that Q((3H) ^ 0. This is because each Q^{(5H) is 
obviously non-empty and compact. Equivalently, to obtain an infinite volume Gibbs 
measure one can fix a particular configuration n and take it as boundary condition. 
By compactness, we obtain an infinite volume Gibbs measure p^ by taking the limit 



of (||) as A I C, at least along suitable subsequences; for details see e.g. Preston [195 ] 



or Georgii [96|. We remark that in general there is no unique limiting measure fil; 
rather there may be several such limiting measures obtained as limits along different 
subsequences. Fortunately, however, this is not the case for a wide class of models, 
either at low temperatures {(3 large) when r\ is a ground state configuration (in the 
realm of the Pirogov-Sinai theory), or at high temperatures when (3 is small. 

We conclude this subsection with a general remark. As all systems in nature are 
finite, one may wonder why we consider here systems with infinitely many constituents. 
The answer is that sharp results for bulk quantities can only be obtained when we 
make the idealization to an infinite system. The thermodynamic limit eliminates finite 
size effects (which are always present but which are not always relevant for certain 
phenomena) and it is only in the thermodynamic limit of infinite volume that we can 
get a clean and precise picture of realistic phenomena such as phase transitions or phase 
coexistence. This is a consequence of the general probabilistic principle of large numbers. 
In this sense, infinite systems serve as an idealized approximation to very large finite 
systems. 

2.7 Phase transition and phases 

As pointed out above, in general there may exist several solutions jx to the DLR-equation 
(H) for given U, V and /?, which means that multiple Gibbs measures exist. The sys- 
tem can then choose between several equilibrium states. (In a dynamical theory this 
choice would depend on the past; but here we are in a pure equilibrium setting.) The 
phenomenon of non-uniqueness therefore corresponds to a phase transition. In fact, 
it is then possible to construct different Gibbs measures as infinite volume limits of 



Gibbs distributions with different choices of boundary conditions [91, p3[. Since any 
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two Gibbs measures can be distinguished by a suitable local observable, a phase tran- 
sition can be detected by looking at such a local observable which is then called an 
order parameter. Varying the external parameters such as temperature or an external 
magnetic field (which can be tuned by the experimenter via some heatbath or reservoir) 
one will observe different scenarios; these are collected in the so called phase diagram 
of the considered system. 

As we have indicated in the introduction, the phase transition phenomenon is of 
central interest in equilibrium statistical mechanics. When phase transitions occur and 
when they do not is also one of the primary questions (although we will encounter 
many others) that we will try to answer with the geometric methods to be developed 
in subsequent sections. 

If multiple Gibbs measures for a given interaction exist, the structure of the set 
Q{f3H) of all Gibbs measures becomes relevant. We only state here the most basic 
results; a detailed exposition can be found in 96], for example. The basic observation 



is that Q{(3H) is a convex set. Its extremal elements, the extremal Gibbs measures, 
have a trivial tail cr-field T (which means that all events in T have probability or 1). 
Equivalently, all macroscopic observables are almost surely constant. In addition, the 
tail triviality can be characterized by an asymptotic independence (or mixing) property. 
On the other hand, any Gibbs measure fi can be decomposed into extremal Gibbs 
measures; therefore every configuration which is typical for [x is in fact typical for some 
extremal Gibbs measure. This shows that the extremal Gibbs measures correspond to 
what one can really see in nature as far as large systems in equilibrium are concerned. 
The extremal Gibbs measures therefore correspond to the physical macrostates, whereas 
non-extremal Gibbs measures only provide a limited description when the system's 
precise state is unknown. For all these reasons, the extremal Gibbs measures are called 
(equilibrium) phases. The central subject of this review is the geometric analysis of 
their typical configurations, and thereby the analysis of the phase diagram giving the 
variation in the number and the nature of the phases as one changes various control 
parameters (coupling, temperature, external fields, etc.). 

Often it is natural to consider automorphisms of the graph (£,~). For example, if 
C = Z d we consider the translation group {0 x ) x€ z d - A homogeneous phase is then an 
extremal Gibbs measure which is also translation invariant. On the other hand, we can 
regard the extremal points of the convex set of all translation invariant Gibbs measures. 
These are ergodic, which means that they cannot be decomposed into distinct transla- 
tion invariant probability measures, and are trivial on the cr-algebra of all translation 
invariant events. However, these extremal translation invariant Gibbs measures need 
not be homogeneous phases; they are only ergodic. Yet, ergodic measures ll satisfy a 
law of large numbers: for any observable / and any sequence of increasing cubes A, 

lim — — f o X = u(f) u-almost surely. 

Hence, ergodic Gibbs measures are suitable for modelling macrostates in equilibrium if 
one limits oneself to measuring certain bulk observables or macroscopic quantities with 
additivity properties. Notice, however, that there exists a certain non-uniformity in the 
literature concerning the nomenclature. Sometimes these ergodic Gibbs measures are 
called (pure) phases. It is then argued that it might happen that two phases (as defined 
above) for a system can by no means be macroscopically distinguished (for example if 
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one is a translation of the other) . We do not wish to enter into a detailed discussion of 
these points. 
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3 Some models 



In this section we discuss briefly the phase transition behaviour of some prototypical 
examples of Gibbs systems. Although these examples are fairly standard and well- 
known to most of our readers, we need to include them here to set up the stage. They 
will be studied in detail in the later sections. An account of phase transition phenomena 



in more general lattice models can be found in many other sources, including [147, 96 



212, 213, 18C]. 



3.1 The ferromagnetic Ising model 

The Ising model was introduced in the 1920's by Wilhelm Lenz fl5£fl and his student 
Ernst Ising [137] as a simple model for magnetism and, in particular, ferromagnetic 



phase transitions. Each site x £ C can take either of two spin values, +1 ("spin up") and 
— 1 ("spin down"), so that the state space is equal to S = {—1, +1}- The Hamiltonian is 
given by ([l]) with U(a(x), a(y)) = —a(x)a(y) and V(a(x)) = —ha(x). The parameter 
ftgR describes an external field. The finite volume Gibbs distribution in a box A with 
external field h at inverse temperature (3 > with boundary condition 77 is thus the 
probability measure (jQ on Q = {— 1,+1}^ which to each a G Q, assigns probability 
proportional to 



I{cr==r) on A c } exp 



For (3 = ("infinite temperature") the spin variables are independent under /J^ p 
but as soon as f3 > the probability distribution starts to favour configurations with 
many neighbor pairs of aligned spins. This tendency becomes stronger and stronger as 
(3 increases. 

In the case h = of no external field, the model is symmetric under interchange 
of the spin values —1 and +1, so that there is an equal chance of having many pairs 
of plus spins or having many pairs of minus spins. This dichotomy gives rise to the 
following interesting behavior. Suppose that C = Z rf , d > 2. If (3 is sufficiently small 
(i.e., in the high temperature regime), the interaction is not strong enough to produce 
any long range order, so that the boundary conditions become irrelevant in the infinite 
volume limit and the Gibbs measure is uniquely determined. By ergodicity and the ± 
symmetry, the limiting fraction of plus spins will almost surely be 1 /2 under this unique 
Gibbs measure. In contrast, when (3 is sufficiently large (in the low temperature regime), 
the interaction becomes so strong that a long range order appears: the bias towards 
neighbor pairs of equal spin then implies that Gibbs measures prefer configurations with 
either a vast majority of plus spins or a vast majority of minus spins, and this preference 
even survives in the infinite volume limit. The system thus undergoes a phase transition 
which manifests itself in a non-uniqueness of Gibbs measures. Specifically, there exist 
two particular Gibbs measures fi + and fj,~, obtained as infinite volume limits with 
respective boundary conditions 77 = +1 and 77 = — 1, which can be distinguished by their 
overall density of +l's: the density is greater than 1/2 under /i + and (by symmetry) less 
than 1/2 under [i~ . This is the spontaneous magnetization phenomenon that Lenz and 
Ising were looking for but were discouraged by not finding it in one dimension. In higher 
dimensions, the uniqueness regime and the phase transition regime are separated by a 
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sharp critical value j3 c , as is summarized in the following classical theorem [187, 62, pq 



Theorem 3.1 For the ferromagnetic Ising model on the integer lattice Z d of dimension 
d > 2 at zero external field, there exists a critical inverse temperature (3 C £ (0, oo) 
(depending on d) such that for (3 < (3 C the model has a unique Gibbs measure while for 
(3 > (3 C there are multiple Gibbs measures. 

A stochastic-geometric proof of this result will be given in Section |6[ In fact, the result 
(as well as its proof) holds for any graph (£, ~) in place of Z rf , except that (3 C may 
then take the values or oo. For instance, on the one-dimensional lattice Z 1 we have 
(3 C = oo, which means that there is a unique Gibbs measure for all (3. For Z 2 , the 
critical value has been found to be (3 C = \ log (l + V%)- This calculation is a remarkable 
achievement which began with Onsager |186| in 1944. An account of various (algebraic 



and/or combinatorial) methods can be found e.g. in [|220| , 171 1. Let us also mention the 
work done in 1973 by Abraham and Martin-L6f ]l[] relating these exact computations to 
the real magnetization in the appropriate Gibbs measures; it also gives the result that 
there is a unique Gibbs measure at the critical value (3 = (3 C . A rigorous calculation of 
the critical value in higher dimensions is beyond current knowledge. It is believed that 
uniqueness holds at criticality in all dimensions d > 2, but so far this is only known for 



d = 2 and d > 4 |L1|. 



The case of a nonzero external field h ^ is less interesting, in that one finds a 
unique Gibbs measure for all (3 and d. The intuitive explanation is that for h ^ there 
is no ± symmetry which could be broken; depending on the sign of h, the system is 
forced to prefer either +l's or — l's. This comes from the fact that the magnetic field 
acts on the whole volume, whereas the influence of a boundary condition is of smaller 
order as the volume increases. In contrast, a phase transition for ft / does occur 
when Zi d is replaced by certain nonamenable graph structures for which the boundary 
of a volume is of the same order of magnitude as the volume itself (which makes them 
physically perhaps less realistic) - an example is the regular tree T^ with d > 2; we 
refer to 1 215 , 96, |140|]. A phase transition can also occur for a non-zero external field 



for the Ising model on a half-space where it is due to the so-called Basuev phenomenon 



19|, |2J. 

Because of the simplicity of its model assumptions, the standard Ising model has 
inspired a variety of techniques for analyzing interacting random fields. Its ferromagnetic 
structure suggests various monotonicity properties which can be checked by the coupling 
methods to be described in Section ||, and the assumption of neigbor interaction implies 
the spatial Markov property (|5|) which plays a fundamental role in the geometric analysis 
of typical configurations. Many techniques which were developed on this testing ground 
turned out to be fruitful also in more general cases. 

3.2 The antiferromagnetic Ising model 

The Ising antiferromagnet is defined quite similarly to the ferromagnetic case, except 
that U(a(x),cr(y)) is taken to be +a(x) a(y) rather than —a(x)a{y). This means that 
neighboring sites now prefer to take opposite spins. 

Suppose that h = and that the underlying graph is bipartite; this means that C 
can be partitioned into two sets jO even and £ dd such that sites in C even only have edges 
to sites in C Q dd-> an d vice versa. Clearly, Z rf is an example of a bipartite graph. In this 
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situation, we can reduce the antiferromagnetic Ising model to the ferromagnetic case by 
a simple spin-flipping trick: The bijection a <-> a of Q, defined by 

cr ( x ) = ^ / \ -f ^ r ( 6 ) 
[ -cr(a;) if x € £ <m 

maps any Gibbs measure for the antiferromagnetic Ising model to a Gibbs measure 
for the ferromagnetic Ising model with the same parameters, and vice versa. As a 
consequence, a phase transition in the antiferromagnetic model is equivalent to a phase 
transition in the ferromagnetic model with the same parameters. Hence, Theorem |3.1| 
immediately carries over to the antiferromagnetic case. 

The model becomes more interesting (or, at least, more genuinely antiferromagnetic) 
if either h / or the graph is taken to be non-bipartite. Suppose first that h / but 
still £ = Z d . If \h\ is small and sufficiently large, we have the same picture as in 
the case h = 0: there exist two distinct phases, one having a majority of plus spins 
on the even sublattice and a majority of minus spins on the odd sublattice, the other 
one having a majority of plus spins on the odd sublattice and a majority of minus 
spins on the even sublattice. We will show this in Section S.5, Example |3.17 ; see also 



[p5| , |9q| . Note that this phase transition is somewhat different in flavor compared to 
that in the ferromagnetic Ising model: whereas in the Ising ferromagnet the phase 
transition produces a breaking of a state-space symmetry, the phase transition in the 
Ising antiferromagnet instead breaks the translation symmetry between the sublattices 
£even and C Q ^. 

To see what happens in the case of a non-bipartite graph we consider the triangular 
lattice which can be obtained by taking the usual square lattice Z 2 and adding an edge 
between each vertex x and its north-east neighbor x + (1, 1). In this case, one expects 
uniqueness when h = 0, and existence of three distinct phases when \h\ ^ is small and 
is large. Phase transitions in these models were studied in |3^, |129(|, for example. 



3.3 The Potts model 

A natural generalization of the ferromagnetic Ising model is the (ferromagnetic) Potts 



model [194], in which each spin may take q > 2 (rather than only two) different values. 



The state space is then S = {1,2, . . . ,q}, and the pair interaction is given by 

U{a{x),a(y)) = 1 - 21 {a{x)=a(y)} . 

We confine ourselves to the case of zero external field, so that V(a(x)) = 0. Taking q = 2 
and identifying the state space {1, 2} with { — 1, +1} we reobtain the ferromagnetic Ising 
model with zero external field. Just as in the latter case, the Potts interaction favours 



configurations where many neighbor pairs agree, and Theorem 3.1 can be extended to 



the Potts model as follows, as we will show in Section 6.3. 



Theorem 3.2 For the q-state Potts model on Z d , d > 2, there exists a critical inverse 
temperature C £ (0, oo) (depending on d and q) such that for < C the model has a 
unique Gibbs measure while for > C there exist q mutually singular Gibbs measures. 



In the same way as Theorem p.l\ this theorem also holds on general graphs provided 
we allow 0c to be or oo. The Potts model differs from the Ising model in that, for 
q large enough, there are multiple Gibbs measures also at the critical value = C , 
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as demonstrated by Kotecky and Shlosman | 148 |; an outline of a proof will be given 



in Example 8.21| . The Onsager critical value for the two-dimensional Ising model is 



believed to extend to the Potts model on Z 2 through the formula (3 c (q) = \ log(l + y/o)', 



see Welsh [ p21| , for example. This has so far only been established when q is sufficiently 
large p49|. 



3.4 The hard-core lattice gas model 

The hard-core lattice gas model (or hard-core model for short) describes a gas of particles 
which can only sit on the lattice sites but are so large that adjacent sites cannot be 
occupied simultaneously. The state space is S = {0, 1}, the pair interaction 



U(a(x),a(y)) 



oo if a(x) = a(y) = 1 
otherwise, 



describes the hard core of the particles, and the chemical potential is V(cr(x)) = 
— (log A) a{x). Here A > is the so-called activity parameter. The hard-core model 
shows some similarities to the Ising antiferromagnet in an external field and can, in 
fact, obtained from it by a limiting procedure {j3 — * oo, h — > 2d, (3(h — 2d) = const, 
|p8]| ). Since U is either or oo, the inverse temperature j3 is irrelevant and will thus 
be fixed as 1, and we can vary only the parameter A. Finite volume Gibbs distribitions 
can then be thought of as first letting all spins be independent, taking values and 1 
with respective probabilities ^tpj and jt^, and then conditioning on the event that no 
two l's sit next to each other anywhere on the lattice. 

The phase transition behavior of the hard-core model on Z rf , d > 2, is as follows. 
For A sufficiently close to 0, the particles are spread out rather sparsely on the lattice, 
and we get a unique Gibbs measure, just as in the Ising antiferromagnet at high tem- 
peratures. When A increases, the particle density also increases, and the system finally 
starts looking for optimal packings of particles. There are two such optimal packings, 
one where all sites in C even are occupied and those in £ dd are empty, and one vice versa; 
we denote these configurations by i]even 

and rjodd, respectively. (These chessboard con- 
figurations look similar to those favoured in the Ising antiferromagnet.) For sufficiently 
large A, the infinite volume construction of Gibbs measures with these two choices of 
boundary condition produces different Gibbs measures, so we get a phase transition 
351. 



Theorem 3.3 For the hard-core model on Zi d , d > 2, there exist two constants < 
A c < A' c < oo (depending on d) such that for A < A c the model has a unique Gibbs 
measure while for A > X' c there are multiple Gibbs measures. 



This result will be proved in Section 6.7. From a computer-assisted proof [201] we know 



that A c > 1.50762. It is widely believed that one should be able to take A c = A^, in this 
result, which would mean that the occurrence of phase transition is increasing in A. Such 



a result, however, would (unlike Theorems 3.1 and 3.2) not extend to arbitrary graph 



structures; some counterexamples were recently provided by Brightwell, Haggstrom and 
Winkler [pEfl . 

The hard-core model analogue of introducing an external field in the Ising model on 
Zi d is obtained by replacing the single activity parameter A by two different activities 
Xeven and X ddi one f° r sit es in C even and the other for sites in £ dd- By analogy with the 
Ising model, one would expect to have a unique Gibbs measure as soon as X even ^ X dd', 
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this was conjectured by Van den Berg and Steif [27| and proved for the case d = 2 by 
Haggstrom [117|. 



3.5 The Widom-Rowlinson lattice model 

The Widom-Rowlinson model is another lattice gas model, where this time there are two 
types of particles, and two particles are allowed to sit on neighboring sites only if they 
are of the same type. Actually, Widom and Rowlinson |223f| originally introduced it as 



a continuum model of particles living in R d ; see Section 10.2 below. The lattice variant 



described here was first studied by Lebowitz and Gallavotti |153|| . The state space is 
S = {— 1,0, +1}, where —1 and +1 are the two particle types, and O's correspond to 
empty sites. The pair interaction is given by 



U(a(x),a(y)) 
and the chemical potential by 



oo if a(x) a(y) = — 1 . 
otherwise, 



- log A_ if a(x) = —1 , 
V(a(x)) = { if a(x) = 0, 
-logA + if a{x) = +1 . 

Here A_, A+ > are the activity parameters for the two particle types —1 and +1. 
As in the hard-core model, we fix the inverse tempreature (3 = 1 and only vary the 
activity parameters. Gibbs measures can then be thought of as first picking all spins 
independently, taking values —1, or +1 with probabilities proportional to A_, 1, and 
A+, and then conditioning on the event that no two particles of different type sit next to 
each other in the lattice. We are mainly interested in the symmetric case A_ = A+ = A, 
where the phase transition behavior on Zi d , d > 2 is similar to the Ising model: For A 
small, there is a unique Gibbs measure in which the overall density of plus-particles is 
almost surely equal to that of the minus-particles. For A sufficiently large, the system 
wants to pack the particles so densely that the ±1 symmetry is broken. As for the Ising 
model, one can construct two particular Gibbs measures fi + and fj,~ using boundary 
conditions rj = +1 and rj = — 1; for small A we get [i+ = whereas for large A the 
two measures are different (and distinguishable through the densities of the two particle 
types), producing a phase transition. 

Theorem 3.4 For the Widom-Rowlinson model on Z d , d > 2, with activities A_ = 
A + = A, there exist < X c < X' c < oo (depending on d) such that for A < A c the model 
has a unique Gibbs measure while for X > X' c there are multiple Gibbs measures. 

As in the hard-core model, we expect that one should be able to take A c = X' c , but 
such a monotonicity is not known. Examples of graph structures where the desired 
monotonicity fails can be found in |p8| . 

We furthermore expect that the asymmetric Widom-Rowlinson model on Z rf with 
A_ 7^ A+ always has a unique Gibbs measure (similarly to the Ising model with a 
nonzero external field), but this also is not rigorously known. 
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4 Coupling and stochastic domination 



Geometry alone will not be sufficient for our analysis of equilibrium phases. We also need 
some probabilistic tools which allow us to compare different configurations and different 
probability measures. So we need to include another preparatory section describing 
these tools and their basic applications to our setting. 

Coupling is a probabilistic technique which has turned out to be immensely useful in 
virtually all areas of probability theory, and especially in its applications to statistical 
mechanics. The basic idea is to define two (or more) stochastic processes jointly on 
the same probability space so that they can be compared realizationwise. This direct 
comparison often leads to conclusions which would not be easily available by considering 
the processes separately. Although an independent coupling is sometimes already quite 



useful (as we will see in Section |7.3| , for example), it is usually more efficient to introduce 
a dependence which relates the two processes in an efficient way. One such particularly 
nice relationship is that one process is pointwise smaller than the other in some partial 
order. This case is related to the central concept of stochastic domination, via Strassen's 
Theorem (Theorem |4.6| ) below. We will confine ourselves to those parts of coupling 
theory that are needed for our applications; a more general account can for example be 
found in the monograph by Lindvall 1 162] . 



4.1 The coupling inequality 

In this and the next subsection of general character, C will be an arbitrary finite or 
countably infinite set. As the notation indicates, we think of the standard case that C 



is the lattice introduced in Section |2.1| , but the following results will also be applied to 
the case when C is replaced by its set B of bonds. We consider again the product space 
Q, = S , where for the moment S is an arbitrary measurable space. 

Suppose X and X' are random elements of fi, and let fi and // be their respective 
distributions. We define the (half) total variation distance — fi'\\ between [i and // 
by 

= supK^)-M'(^)l (7) 

where A ranges over all measurable subsets of The coupling inequality below provides 
us with a convenient upper bound on this distance. To state it we first need to define 
what we mean by a coupling of X and X'. 

Definition 4.1 A coupling P of two Vt-valued random variables X and X' , or of their 
distributions fj, and fj/ 7 is a probability measure on Q x £1 having marginals [i and //, in 
that for every event A C SI 

P{(£,?):£eA)=n(A) (8) 

and 

P((Z,0--Z'£A) = n'(A). (9) 

We think of a coupling as a redefinition of the random variables X and X' on a new 
common probability space such that their distributions are preserved. Sometimes it will 
be convenient to keep the underlying probability space implicit, but in general, as in 
(||) and (|9|), we make the canonical choice, which is the product space Q x Q; X and X' 
are then simply the projections on the two coordinate spaces. With this in mind, we 
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write P(X G A) and P(X' G A) for the left hand sides of (JsJ) and (§), respectively. In 
the same spirit, P(X = X') is a short-hand for -P((£,£') : £ = £,')■ 

Proposition 4.2 (The coupling inequality) Let P be a coupling of two Q-valued 
random variables X and X' , with distributions fx and fi! . Then 

Ha*- a*'II < P{X ^X'). (10) 

Proof: For any A G f2, we have 

/x(A)-^'(A) = P(le4)-P(l'6i) 

= P(A~ £ A, X' ^ A) — P(X A,X' £ A) 

< P(X£A,X'^A) 

< P(X^X'), 

whence ( |l0| ) follows by symmetry. □ 

The next result states that there always exists some coupling which achieves equality 
in (10). We call such a coupling optimal. 

Definition 4.3 A coupling P of two tt-valued random variables X and X' , with distri- 
butions fx and fi! ' , is said to be an optimal coupling if 

Wim-h'W =p(x±x'). 



Proposition 4.4 For any two Q-valued random variables X and X' , there exists an 
optimal coupling of X and X' . 



To construct an optimal coupling, one simply puts the common mass fx A fx' of fx and 
fx' on the diagonal of x fl and adds any measure with marginals fx — fx A fx' and 
fx' — fx A /j,'; the simplest choice of such a measure is the appropriately scaled product 
measure. For details we refer to [162], where such a coupling is called the 7-coupling of 
fx and fx' . This construction shows, in particular, that the optimal coupling is in general 
not unique. Applications of optimal couplings to interacting particle systems can be 
found in fll60j, |16q], for example. 



4.2 Stochastic domination 

Suppose now that S is a closed subset of R, so that 5 is linearly ordered. The product 
space is then equipped with a natural partial order ^ which is defined coordinatewise: 
For £ fi, we write £ ■< £' (or £' >z £) if £(x) < £'(x) for every x £ C. A function 
/ : fi —* R is said to be increasing (or, non-decreasing) if /(£) < /(£') whenever 
£ ■< An event ^4 is said to be increasing if its indicator function I a is increasing. The 
following standard definition of stochastic domination expresses the fact that // prefers 
larger elements of fi than fx. 

Definition 4.5 Let fx and ft' be two probability measures on fi. We say that fx is 
stochastically dominated by fx' , or fx' is stochastically larger than fx, writing fx <d ft-', 
if for every bounded increasing observable f : fi — ► R we have fx(f) < fx'(f). 
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In the one-dimensional case when \C\ = 1 and Q = S C R, this definition is equivalent 
to the classical requirement that /i([r, oo)) < fjf([r, oo)) for all r £ R. The following 
fundamental result of Strassen | J216[ | characterizes stochastic domination in coupling 
terms. 

Theorem 4.6 (Strassen) For any two probability measures fj, and // on £1, the fol- 
lowing statements are equivalent. 

(i) fi -<v n' 

(ii) For all continuous bounded increasing functions f : fi — ► R, < //(/). 

(iii) There exists a coupling P of fi and fj/ such that P(X ■< X') = 1. 

Sketch of proof: While the implications (i) (ii) and (iii) =>■ (i) are trivial, the 
assertion (ii) (iii) is too deep to be explained here in detail. To start one should note 
that n and // may be considered as measures on the compact space [— oo, oo] £ , and one 



can then follow the arguments outlined in |160||, pp. 72 ff., or [162]. Further discussion 



of Strassen's theorem can be found in 162 , 141]. □ 



The equivalence (i) 44> (ii) in Theorem [O] readily implies the following corollary. 

Corollary 4.7 The relation -<d of stochastic domination is preserved under weak lim- 
its. 

Next we recall a famous sufficient condition for stochastic domination. This condition is 



(essentially) due to Holley [134] and refers to the finite-dimensional case when \C\ < oo. 
We also assume for simplicity that S C R is finite. Hence f2 is finite. In this case, 
a probability measure \x on ft is called irreducible if the set {n £ £2 : fx(r]) > 0} is 
connected in the sense that any element of f2 with positive /^-probability can be reached 
from any other via successive coordinate changes without passing through elements with 
zero //-probability. 

Theorem 4.8 (Holley) Let C be finite, and let S be a finite subset of R. Let fi and 

// be probability measures on f2. Assume that // is irreducible and assigns positive 
probability to the maximal element of £1 (with respect to ■<). Suppose further that 

n(X{x) >a\X = £oftx)< y!{X{x) > a | X = 77 off x) (11) 

whenever x £ C, a £ S , and £, rj £ SAW are such that £ ^77, fx{X = £ off x) > and 
f/(X = rj off x) > 0. Then fj, // . 

Proof: Consider a Markov chain (X^^Lq with state space f2 and transition probabilities 
defined as follows. At each integer time k > 1, pick a random site x £ C according to 
the uniform distribution. Let X^ = Xk-i on £ \ {x}, and select Xk(x) according to the 
single-site conditional distribution prescribed by fi. This is a so-called Gibbs sampler 
for fj,, and it is immediate that if the initial configuration Xq is chosen according to fj,, 
then Xfc has distribution [i for each k. Define a similar Markov chain (XI) ^L with // 
replaced by //. 

Next, define a coupling of (X^^Lq and (X' k )^L Q as follows. First pick the initial 
values (Xq,Xq) according to the product measure fi x fj, 1 '. Then, for each k, pick a site 
x £ C at random and let Uf. be an independent random variable, uniformly distributed 
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on the interval [0, 1]. Let X^{y) = X^i(y) and X' k {y) = X' k _ 1 (y) for each site y ^ x, 
and update the values at site x by letting 

Xk(x) = max{a G 5* : /i(X(x) > a \ X = £ off x) > £/&} 

and 

A^(x) = max{a G 5 : n'{X'{x) > a\ X' = V ott x) > U k } 

where £ = X&_i(£ \ {x}) and r] = X' k _ l (C\ {x}). It is clear that this construction gives 
the correct marginal behaviors of (X^^Lq an< ^ {^'k)T=Q- The assumption (|IT|) implies 
that Xfc ^ X(, whenever Xfc_i < X' k _ v By the irreducibility of //, the chain (X£,)^l 
will almost surely hit the maximal state of O at some finite (random) time, and from 
this time on we will thus have X k if* X' k . Since the coupled chain (X k , XCj^Q is a finite 
state aperiodic Markov chain, (X k ,X' k ) has a limiting distribution as k — > oo. Picking 
(X, X') according to this limiting distribution gives a coupling of X and X' such that 
X < X' almost surely, whence [i ■<■£> // by Theorem 4.6, □ 



A non-dynamical proof of Honey's inequality by induction on |£|, together with an 
extension to non-finite S, was given by Preston J197]]; the simplest induction proof of 



an even more general result can be found in [21]. 

As a consequence of Holley's inequality we obtain the celebrated FKG inequality 
(Theorem 4,11| below) of Fortuin, Kasteleyn and Ginibre ||S3| ], who stated it under 



slightly different conditions. It concerns the correlation structure in a single probability 
measure rather than a comparison between two probability measures. 

Definition 4.9 A probability measure fj, on Q is called monotone if 

H{X{ X ) > a | X = £ off x) < n(X(x) >a\X = r]oSx) (12) 

whenever x G C, a G S , and £, r\ G are such that £ ^ rj, fi(X = £ off x) > and 

H(X = r) off x) > 0. 

Intuitively, [i is monotone if the spin at a site x prefers to take large values whenever 
its surrounding sites do. 

Definition 4.10 A probability measure fi on Q is said to have positive correlations 

if for all bounded increasing functions f,g: 12 — > R we have 

M/<?)>K/Ms)- 

Since the preceding inequality is preserved under rescaling and addition of constants 
to / and g, [i has positive correlations whenever /x -<x> 1^' for any probability measure 



fj,' with bounded increasing Radon-Nikodym density relative to [i. Theorem 4.6 thus 
shows that \x has positive correlations whenever /J,(fg) > //(/) fJ>(g) for all continuous 
bounded increasing functions / and g. Hence, the property of positive correlations is 
also preserved under weak limits. 

Theorem 4.11 (The FKG inequality) Let C be finite, S a finite subset of R, and 
fj, a probability measure on fi which is irreducible and assigns positive probability to the 
maximal element of£l (relative to ^). If fi is monotone, it also has positive correlations. 
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Proof: Suppose fjf is a second probability measure on fl such that n'(rj) = fj,(r])g(r]) 
for all rj G and some positive increasing function g. For x G £, a G S and £ G gAw 
such that /u(A = £ off x) > we write q x (a, £) = [i{X(x) > a\X = £ o& x) and define 
9x( a ;0 similarly in terms of // . Then 



£(a,o/(W>,0) = E#* s )5r' s )/EMe'' s )5(r' s ) 

s>a ' s<a 

> E#* s )/EMr- s ) 



s>a ' s<a 

Qx(a,0 I (1 - fefa, £))• 



Together with assumption (^), this implies that \i and // satisfy ([ll]). Theorem 4.8 



thus implies that /i :<x> //, and the corollary follows. □ 

Finally we state a simple observation showing that, under the condition of stochastic 
domination, the equality of the single-site marginal distributions already implies the 
equality of the whole probability measures. 

Proposition 4.12 Let C be finite or countable, and let /i and fj! be two probability 
measures on ft = R £ satisfying /i <d ■ If, in addition, n{X(x) < r) = fi'(X(x) < r) 
for all x £ £ and r G R then jj, = jj, 1 '. 

Proof: Let P be a coupling of ^ and [i! such that P{X ^ X') = 1 which exists by 



Theorem 4.6. Writing Q for the set of rational numbers, we have for each x G C 
P(X(x) / X'{x)) = P{X{x) < X'(x)) 

< e p ( x ( x ) ^ r > x '( x ) > r ) 



reQ 



E (p{X(x) < r) - P(X'(x) < r)\ 



reQ 

= 0. 

Summing over all x G C we get P(X ^ X') = 0, whence ji = \j! by (|lO|). □ 



4.3 Applications to the Ising model 

We will now apply the results of the previous subsection to the ferromagnetic Ising 
model. Let (£, ~) be any infinite locally finite graph. For definiteness, one may think 
of the case C = Z d ; the arguments are, however, independent of the particular graph 
structure. 

As in Section 3.1, we write ^ p ^ for the Gibbs distribution in a finite region A 



with boundary condition r\ G Q and external field h G R at inverse temperature (3 > 
0. Our first application of Honey's theorem asserts that if one boundary condition 
dominates another, then we also have stochastic domination between the corresponding 
finite volume Gibbs distributions. 



Lemma 4.13 // the boundary conditions satisfy £ X n, then 

r4,/3,A - v ^h,/3,A ■ 
Also, each /^i /3 a ^ as P os iti ve correlations. 
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Proof: The conditional probability of having a plus spin at a given site x given the 
configuration £ everywhere else is equal to (1 + exp[— 2(3(h + J2y-. y ~x £(y))]) -1 > which 



is an increasing function of £. Theorem [4.8| and Theorem [4,11 thus imply stochastic 
domination between the projections of the Gibbs distributions to 5 A and the positive 
correlations property. As their behavior outside A is deterministic, the lemma follows. 
□ 

We write /J^g^ and ^ or ^ ne ^ n ^ e volume Gibbs distributions obtained with 



respective boundary conditions rj = +1 and n = —1. Lemma 4.13 then shows that 



Mfc,/3,A Mfc,/9,A A*J/?,A ( 13 ) 

for any 77 € O. This sandwich inequality reveals the special role played by the "all 
plus" and "all minus" boundary conditions. Next we establish the existence of the 
limiting "plus measure" discussed in Section |3.l| . We will say that a measure /i on Q 
is homogeneous if it is invariant under all graph automorphisms of (£, ~). In the case 
C = Z d , a homogeneous measure is thus invariant under translations, lattice rotations, 
and reflections in the axes. We write A f C for the limit along an arbitrary increasing 
sequence of finite regions which exhaust the full graph C. 

Proposition 4.14 The limiting probability measure 

exists, fifr p is a homogeneous Gibbs measure for the Ising model on (£, ~) with external 
field h and inverse temperature (3 and has positive correlations. 



Proof: By the general theory in Section |2.6| , the limit is a Gibbs measure whenever 



it exists. Also, by Lemma 4.13 the limit must have positive correlations. To show the 



existence of the limit we note that 

HhfiA ~ v M >Ua whenever A C A . (14) 



This follows from Lemma 4.13 because /i^g A is obtained from fi^ s A by conditioning 
on the increasing event that X = +1 on A \ A. 

Now, for any finite A C A, if A increases, then, by (|14|), /j^ a (A = +1 on A) 
decreases and therefore converges to inf a ^(X = +1 on A) . Note that this limit is 
obviously invariant under any automorphism of (£, ~). By inclusion-exclusion it follows 
that, for any local observable /, \x^a \{f) converges to an automorphism invariant limit 
as A I C. These limits determine a unique homogeneous probability measure 3 which, 
as a weak limit of finite volume Gibbs distributions, is a Gibbs measure. The lemma is 
thus proved. □ 

Obviously, replacing the "all plus" boundary condition by the "all minus" boundary 
condition, we obtain in the same way an automorphism invariant infinite volume Gibbs 
measure fx^ g with positive correlations. In the same way, Lemma 4.13j shows that 



any extremal Gibbs measure has positive correlations (since it is a weak limit of 3 A 
for suitable rj). However, positive correlations may fail for suitable (£,~) and some 
particular non-extremal Gibbs measures. For instance, when C = Z 3 and (3 is sufficiently 
large, one can take a convex combination of two different so-called Dobrushin states; 
see [|66|, E2f . 
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We now take the limit in the sandwich inequality (|13|). Let // be any Gibbs measure 



for the Ising model with parameters h and j3. Taking the mean J n(drj) in (13), we 
obtain that /^aa div I 1 Tiv r a> an d since stochastic domination is preserved under 
weak limits, we end up with 

H~h,p —V V <v l4 tj3 (15) 
when [i is any Ising-model Gibbs measure for (3, h. On the one hand, this shows that 



\i h a and fj^p are extremal, and thus equilibrium phases in the sense of Section |2J. On 



the other hand, we obtain an efficient criterion for the existence of a phase transition 



which was first observed by Lebowitz and Martin-L6f [155] and Ruelle 205 1 



Theorem 4.15 For the Ising model on an infinite locally finite graph (£, ~) with ex- 
ternal field h 6 R and inverse temperature (5, the following statements are equivalent. 

(i) There is a unique infinite volume Gibbs measure. 

(ii) H,p = 

(iii) Hl (X{x) = +1) = nt )f) (X(x) = +1) for all x e C. 

Proof: The implications (i) => (ii) =>■ (iii) are immediate, (iii) (ii) follows directly 
from ( |i"5| ) and Proposition 4.12| , and (ii) =>• (i) from (15). □ 



Remarks: (a) In the case h = of no external field, assertion (iii) is equivalent to 
b(X{x) = +1) = 1/2 for all x G C, by the ± symmetry of the model. An extension 
of Theorem |4.15| in this case to the (/-state Potts model will be given in Theorem 6.10 



(b) If the graph automorphisms act transitively on (£,~) then, by homogeneity, 
assertion (iii) is equivalent to having the equation only for some x £ C. Using for 
example the random-cluster methods of Section ^ one can obtain the same equivalence 
also in the general case, assuming only that C is connected. 

(c) If £ = Zi d , (iii) is equivalent to the condition that the free energy density is 
differentiable with respect to h at the given values of h and j3 [155]. By the celebrated 



Lee- Yang circle theorem [|203j, this is the case whenever h ^ 0. Alternatively, one can 
use the so-called GHS inequality to establish (iii) for h ^ |196[| . Hence, for non-zero 
external field the Ising model on Z d does not exhibit a phase transition. 

We conclude this subsection comparing the Ising-model "plus" measures for different 
values of the parameters. (For similar results in a lattice gas setting sec |156[ |.) 

Proposition 4.16 Consider the Ising model on an arbitrary graph (£, ~) at two inverse 
temperatures (3\, 02 and two external fields hi, hi- Suppose that either f}\ = fa and 
hi < hi, or (£, ~) is of bounded degree N = sup xg £ N x and /32^2 > 0ihi + N\/3i — 
Then 

Proof: The stated conditions imply that fj,^ ^ {»}("'"■'■) — {x}(~^~^ whenever 

£ V- Hence, by Theorem [D^, (i£ a A <x> l±X 2 g a f° r an ^> ana - * ne proposition follows 
by letting A | C. ' □ 

If C has bounded degree N, we obtain in particular a comparison of Ising and Bernoulli 
measures. Let ifjp denote the Bernoulli measure on {—1, 1} £ with density p. Then 
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y-Xfi -> as P 

thus show that 



and 



and [3h — > ^ log j^— . The preceding proposition and Corollary [L7 



I J) 

Phft — © ^ P for /» > log + N 



1 p 

±v tp P for /» < -j log j ^ • 



2/3 
1 

2^ 



P 



4.4 Application to other models 



Do the arguments of the previous subsection extend to the other models of Section 
3? The answer to this question is different for the different models. For the Potts 



model with q > 3, Lemma 4.13 fails because the conditional probability that the spin 
at a site x takes a large value is not increasing in the surrounding spin configuration. 



Nevertheless, the Potts model admits some analogues of Proposition [4.14| and Theorem 
4.15| . These results are deeper than their Ising counterparts, and will be demonstrated 
by random-cluster arguments in Section |6[ 

The Widom-Rowlinson model exhibits the same monotonicity properties as the Ising 



model. We thus obtain Widom-Rowlinson analogues of Lemma 4.13 and Proposition 
4.14 . Fixing any two activity parameters A+, A_ > and writing fi + (resp. fi~) for 



the associated limiting Gibbs measures with "all plus" (resp. "all minus") boundary 
conditions, we find that any other Gibbs measure for the same parameters is sandwiched 



(in the sense of fllq)) between fi + and (jl . The analogue of Theorem 4.15 reads as follows. 



Theorem 4.17 For the Widom-Rowlinson model on an infinite locally finite graph 
(£,~) with activity parameters A+, A_ > 0, the following statements are equivalent. 

(i) There is a unique infinite volume Gibbs measure. 

(ii) fi+ = n~ 

(iii) n + {X{x) = +1) = p-{X{x) = +1) for all x £ £. 

The Ising antiferromagnet and the hard-core lattice gas model are far from satisfying 



the monotonicity properties needed for the arguments in Section 4.3; the conditional 
probability that a site x takes the value +1 is decreasing, rather than increasing, in the 
surrounding configuration. However, when (£, ~) is bipartite (as in the case C = Z d ) 
we can use again the trick of (|6|) to flip all spins on the odd sublattice. The Ising 
antiferromagnet is then mapped onto the Ising ferromagnet with a staggered external 
field (having alternating signs on the even and the odd sublattices) . Similarly, the 
hard-core model is mapped into a model which also exhibits the necessary monotonicity 
properties. We thus obtain analogous results which we spell out only for the hard-core 
model with activity A > : There exist two particular Gibbs measures \x e ^ en and 
obtained as infinite volume limits of finite volume Gibbs distributions with respective 



boundary conditions rj even and rj 0( id, defined in Section 3.4. In terms of these two Gibbs 



measures, the existence of a phase transition can be characterized as follows. 
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Theorem 4.18 For the hard-core model on an infinite locally finite bipartite graph 
(C, ~) with activity parameter X, the following are equivalent. 

(i) There is a unique infinite volume Gibbs measure. 

..even ..odd 

{11) fl x — fl x 

(iii) fi e x ven (X(x) = 1) = d {X(x) = 1) for all x G C. 
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5 Percolation 



We will now introduce the ideas of random geometry we referred to in the title of this 
paper. As these were developed first in the framework of percolation theory, we devote 
this section to a description of this subject. We will start with the classical case of 
independent, or Bernoulli, percolation, and will then proceed to the case of dependent 
percolation. In the subsequent sections we will see how these results and ideas can be 
used for the geometric analysis of equilibrium phases. 



5.1 Bernoulli percolation 

Bernoulli percolation was introduced in the 1950's in papers by Broadbent and Ham- 
mersley p9|, 124, 125] as a model for the passage of a fluid through a porous medium. 



In fact, the model has appeared first in [79] in the context of polymerization. We give 



here only a brief introduction; a thorough treatment can be found in the the books and 
lectures by Grimmett [105, |110| ], Chayes and Chayes [52], and Kesten [ 143 1 . 



The porous medium is modelled by a graph (£, ~), and either the sites or the bonds 
of this graph are considered to be randomly open or closed (blocked). We begin with 
the case of site percolation; the alternative case of random bonds will be discussed at 
the end of this subsection. 

The basic question of percolation theory is how a fluid can spread through the 
medium. This involves the connectivity properties of the set of open vertices. To 
describe this we introduce some terminology. A finite path is a sequence (i>o, e%, v%, e^-, 
• • • , e&, Ufc), where Vq, . . . , £ C are pairwise distinct vertices and e%, . . . , e& 6 B are 
pairwise distinct edges such that, for each i E {1, . . . , k}, the edge ei connects the vertices 
Vi-i and vi. Obviously, a path is equivalently described by its sequence (vq, v\, . . . , v^) 
of vertices or its sequence (ei, e-z-, ■ ■ ■ > e-k) of edges. The number k is called the length 
of the path. In the same way, we can also speak of infinite paths (t>o, ei, v\, 62, ■ ■ •) and 
doubly infinite paths (. . . , eo, Vq, e\,v\, . . .). A region C C C is called connected if 
for any x,y £ C there exists a path which starts at x, ends at y, and which only contains 
vertices in C. 

An open path is a path on which all vertices are open. An open cluster is a maximal 
connected set C in which all vertices are open; here maximal means that there is no 
larger region C'dC which is connected and only contains open vertices. An infinite 
open cluster (or infinite cluster, for short) is an open cluster containing infinitely many 
vertices. Using these terms, we may say that the existence of an infinite open cluster is 
equivalent to the fact that a fluid can wet a macroscopic part of the medium. 

We now turn to the classical case of Bernoulli site percolation with retention param- 
eter p G [0, 1]. In this case, each vertex of £, independently of all others, is declared to 
be open (and represented by the value 1) with probability p and closed (with value 0) 
with probability 1 — p. We write ip p for the associated (Bernoulli) probability measure 
on the configuration space {0, 1} . 

The first question to be asked is whether or not infinite clusters can exist. This 
depends, of course, on both the graph (£, ~) and the parameter p. The basic observation 
is the following. 

Proposition 5.1 For Bernoulli site percolation on an infinite locally finite graph 
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(£, ~), there exists a critical value p c £ [0, 1] such that 



{0 if T) *\ T) 
■r C 
1 lfp>p c . 

At the critical value p = p c , the ip p -probability of having an infinite open cluster is either 
or 1. 

Proof: A moment's thought reveals that the existence of infinite clusters is invariant 
under a change of the status of finitely many vertices. By Kolmogorov's zero-one law, 
the Vp-probability of having infinite clusters is therefore either or 1. It remains to 
show that this probability is increasing in p. The existence of an infinite open cluster is 
obviously an increasing event, so we are done if we can show that 

VVi ^p 2 whenever p\ < p 2 . (16) 

This is intuitively obvious and can be proved by the following elementary coupling 
argument. Let Y = (Y(x)) x&J r be a family of i.i.d. random variables with uniform 
distribution on [0,1], and for p £ [0,1] let X p = (X p (x)) x& c be defined by X p (x) = 
I{Y(x)<p}- It is then clear that X p has distribution ip p and X pi X X p2 whenever p\ < p%. 
This implies ([!(]) by (the trivial part of) Theorem L_6. (Note that we have in fact 



constructed a simultaneous coupling of all VV S > an d that this construction would be 
used in Monte Carlo simulations of Bernoulli percolation.) □ 

We write {x «-> oo} for the event that x £ C belongs to an infinite cluster, and set 
Qxip) = ^p{x *-* oo). For homogenous graphs such as £ = Z d , 9 x (p) does not depend 
on x, and then we write simply 9(p). Equation (16) shows that 6 x (p) is increasing in p. 
We also make the following observation. 

Proposition 5.2 For any infinite locally finite connected graph (£, ~), any x £ C and 
any p £ (0, 1), we have 9 x (p) > if and only if ip p (3 an infinite open cluster) = 1. 



Proof: From Proposition we know that an infinite cluster exists with probability 
or 1. The implication "only if" is therefore immediate. For the "if" part, we note that 
if an infinite cluster exists with positive probability, then there is some N such that 
ipp(Aj^) > 0, where An is the event that some vertex within distance N from x belongs 
to an infinite cluster. On the other hand we have i/j p (Bn) > 0, where Bn is the event 
that all vertices within distance N— 1 from x are open. The event AnPiBn implies that x 



belongs to an infinite cluster. But A jy and Bn are increasing events (see Section 4.2), so 



that we can apply Theorem 4.11 to obtain 9 x {p) > iJj p (An CiBn) > iP p (An)?P p (Bn) > 0. 
□ 

Note that the proof above applies to the much broader class of all measures with positive 
correlations (recall Definition 4.10| ), rather than only the Bernoulli measures. 



Next we ask whether both possibilities in Proposition 5.2 really occur, that is, if 
< p c < 1. For, only in this case we really have a nontrivial critical phenomenon at 
p c . The answer depends on the graph. For C = Z d with dimension d > 2, the threshold 
p c is indeed nontrivial, as is stated in the theorem below. This nontriviality of p c is a 
fundamental ingredient of many of the stochastic-geometric arguments employed later 
on. On the other hand, it is easy to see that p c = 1 for C = Z . 
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Theorem 5.3 The critical value p c = p c (d) for site percolation on C = Z d , d > 2, 
satisfies the inequalities 

1 <pc<t (17) 



2d- 1 - 7 

Proof: We begin with the lower bound on p c . For fc = 1, 2, . . ., we write JVjfe for the 
random number of open paths of length k starting at 0. On the event {0 <-» oo} we have 
Nk > 1 for each k, whence 

0(p) < MNk) ■ (18) 

The number of all paths of length k starting at is at most 2d(2d — l) fc_1 , and each 
path is open with probability p k . Hence 

%(N k ) < 2d(2d - if- y 

which tends to as k — > oo whenever p < l/(2d — 1). In combination with (IT3) this 



implies that 6{p) = for p < l/(2d — 1), and the first half of (17) is established. 

The second half of ([[7]) only needs to be proved for d = 2; this is because Z 2 can be 
embedded into Z d for any d > 2, so that p c (d) < £> c (2). So let d = 2. We first need some 
additional terminology. A *-path in Z 2 is a sequence (vq, v i, . . . , v k ) of distinct vertices 
such that doo(vj-ij Uj) = 1 for j = 1, ... fc. Note that two consecutive vertices in a *- 
path need not be nearest-neighbors; they may also be "diagonal neighbors". A ^-circuit 
is a sequence {vq,v\, . . . , Vk, Vo) such that (vq, v\, . . . , Vk) is a *-path and doo(vk,vo) = 1. 
Informally, a *-circuit is a *-path which ends where it starts. *-circuits with the same 
set of sites are identified. A closed ^-circuit is a *-circuit in which all vertices are closed. 

Now let M be the number of closed *-circuits that surround the origin 0. As Z 2 
is a planar graph, the "outer boundary" of a finite open cluster containing defines a 
closed *-circuit around 0. Hence, the event {0 «-> oo} occurs if and only if M = 0. 

The number of all (not necessarily closed) *-circuits of a given length k surrounding 
allows the following crude estimate. Consider the leftmost crossing of the x-axis of 
such a circuit; the location of such a crossing is at distance at most k from the origin, 
so there are at most k such locations to choose from. Starting at this location, we may 
trace the *-circuit clockwise (say), and at each step we have at most 7 doo-neighbors to 
choose from. Hence, the number of *-circuits of length k around is at most /c7 fc_1 . 
Each one is closed with probability (1 — p) k , so 

oo 

^(M)<^A;7 fc - 1 (l-p) fe . 

k=l 

The last sum is finite for p > 6/7. Hence, for such a p and n large enough, there is 
a positive probability for having no closed *-circuit around which contains a site of 



distance at least n from 0. By the argument in the proof of Proposition 5.2, it follows 
that ip p {M = 0) > for such p. Hence 9{p) > for p > 6/7, and the second half of ([l7]) 
follows. □ 

The exact value of the percolation threshold p c {d) of Z d is not known for any d > 2. 
The best rigorous bounds for d = 2 are presently 

0.556 < p c (2) < 0.680 (19) 
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where the first inequality is due to van den Berg and Ermakov |Eq| (inspired by |17Gfl ) 



and the second to Wierman [224]. For high dimensions, it is known that 



lim 2dp c {d) = 1 , (20) 



see Kesten | 144 |, Gordon [104] and Hara and Slade [127] for this and finer asymptotics. 
On some particular graphs, p c can be determined exactly. For example, for C = T^, 
the regular (Cayley or Bethe) tree, branching-process arguments immediately show that 
p c (Trf) = 1/d, and for the triangular lattice it follows from planar duality that p c = 1/2 

B. 

The preceding considerations do not tell us what happens at the critical value p c . 
It is believed that, for the integer lattice Z d of any dimension d > 2, there is if) Pc -&.s. 
no infinite cluster, which means that 9(p c ) = 0; so far this is only known for d = 2 
and d > 19, see Russo |209| and Hara and Slade |2§]. One can show that the relation 



@(Pc) = implies continuity of Q(p) at p = p c , so in combination with the trivial 
continuity of 9{p) in the subcritical regime and the following more interesting result 
(which can be found e.g. in [|108| ) we get continuity of 9(p) throughout [0, 1] as soon as 
absence of infinite clusters at criticality is established. 

Theorem 5.4 For Bernoulli site percolation on Zi d , d > 2, the function 9(p) is contin- 
uous throughout the supercritical regime (p c , 1]. 

So far we were interested in the existence of infinite clusters. In the subcritical 
regime p < p c when no infinite cluster exists, one may ask for the size of a typical 
cluster. Let |Co| be the random number of vertices in the open cluster containing the 
origin; we set | Co I = if the origin is closed. By the definition of p c , tp p (\Co\ = oo) = 0. 
For C = Z d , we even have the stronger statement that the expected value of |Co| is 
finite. 

Theorem 5.5 For Bernoulli site percolation on Z d with retention parameter p < p c , 
we have ip p (\Co\) < oo. 



This was proved independently by Menshikov [174| and by Aizenman and Barskyjq 



The proofs are rather involved, so we refer the reader to the original articles and ||10£ 



It is worth noting that Theorem 5.5 fails in the setting of general graphs; a striking 
counterexample is the "three-one-tree" discussed on p. 936 of Lyons |164| ]. 

Menshikov even showed that the distribution of the radius of the open cluster con- 
taining the origin decays exponentially. The following even stronger result states that 



the same is true for the distribution of |Cq|; see Grimmett [105] for a proof (in the case 
of bond percolation). 

Theorem 5.6 For Bernoulli site percolation on Z d with retention parameter p < p c , 
there exists a constant c ( depending on p) such that 

M\Co\>n) < e~ cn 

for all n. 

Looking at a fixed path of length n starting at 0, we immediately obtain the lower 
bound i^ p {\Cq\ > n) > p n . So the preceding upper bound is best possible, except that 
the optimal constant c is unknown. 
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We conclude this section with some remarks on Bernoulli bond percolation. The 
model is similar, except that now the edges rather than the vertices in (£, ~) are in- 
dependently open (described by the value 1) or closed (with value 0) with respective 
probabilities p and 1 — p. The associated configuration space is thus {0, 1} B . We write 
4> p for the associated (Bernoulli) probability measure on {0, In the present context 
of bond percolation, an open path is a path in which all edges are open, and an open 
cluster is a maximal region C C C which is connected, in that for any x,y G C there is 
an open path in C from x to y. 

All results for site percolation discussed so far extend to the bond percolation set- 
up. This is no surprise because bond percolation is equivalent to site percolation on 
the so-called covering graph for which B is taken as set of vertices, and edges are 
drawn between any two coincident elements of B. In particular, there exists again a 
critical value p c for the occurrence of infinite open clusters, and Propositions 5.1 and 



5.2, Theorems |5.3| , 5.5, |5.6| and 5.4, and the asymptotic formula ( J20| ) are still true in 
the case of bond percolation. What is generally different, are the critical values for 
site and bond Bernoulli percolation on a given graph. One remarkable case is that of 
bond percolation on Z 2 , where (again by planar duality) p c = 1/2; this is a famous 
result of Kesten 142|. In the specific case of trees, however, site and bond percolation 
are equivalent. In particular, for C = T^ we have p c = 1/d for both site and bond 
percolation; for more general trees a formula for p c was given by Lyons [164]. 



5.2 Dependent percolation: the role of the density 

Our main subject is the analysis of equilibrium phases by means of percolation methods. 
In this case, a site will be considered as open if, for example, the configuration in a 
neighborhood of this site shows a specified pattern, and the events "site x is open" with 
x G C are then far from being independent. This leads us to considering the case of 
dependent percolation. In this subsection we do some first steps in this direction. 

Our starting point is the following question. In the case of Bernoulli percolation, 
there is a unique parameter, the occupation probability or density p, which governs 
the phase diagram and allows to distinguish between subcritical ("no infinite clus- 
ter") and supercritical ("at least one infinite cluster") behavior. Does this also hold 
in general? Specifically, is it true that for any translation invariant probability mea- 
sure /x on {0,1} Z , the occurrence of an infinite cluster only depends on the density 
p(fi) = fi(X(x) = 1) of open sites x G Z d ? In general, the answer is obviously "no", as 
we will now show by two simple examples: there exist translation invariant measures \i 
on {0, 1} Z with arbitrarily small densities such that infinite clusters exist almost surely, 
and also translation invariant measures with densities arbitrarily close to 1 for which 
no infinite clusters exist with probability 1. 

Example 5.7 For q G (0, 1), let (Y(x), x G Z) be i.i.d. random variables taking values 
and 1 with probability 1 — q resp. q. We define a translation invariant random field 
(X(x), x G Z d ), d > 2, by setting X(x) = X(x 1 ,...,x d ) = Y(xi) for each x G Z d . 
Writing /x for the distribution of (X(x), x G Z rf ), we have p(/i) = q, but with \i- 
probability 1 there exist infinitely many infinite clusters, even if q is arbitrarily small. 



Example 5.8 Again let q G (0, 1), d > 2, and (Y(x,i), x G Z, i G {1, . . . , d}) be i.i.d. 
random variables taking values and 1 with probabilities 1 — q and q. We define a 
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translation invariant random field (X(x), x E Z rf ) by setting 

d 

X(x) = X(xi, ...,x d ) = Y[ Y(xi,i). 

i=l 

A moment's thought reveals that /i-a.s. there exist no infinite clusters, despite the fact 
that p(pL) = q d may be arbitrarily close to 1. 

These examples suggest to look for additional assumptions under which high (resp. low) 
density guarantees existence (resp. nonexistence) of infinite clusters. Positive correla- 
tions (in the sense of Definition |4.10| ) does not suffice, because both examples above 
obviously have positive correlations. 

An alternative might be to assume ^-independence in the sense that AT (A) and 
A (A) are independent for any two finite regions A, A C C such that 

min \x — y\ > R 

x£A,y£A 

for some given R. For Zi d , d > 2, this gives nontrivial thresholds < p\ < p2 < 1 (de- 
pending on R) such that existence (resp. non-existence) of infinite clusters is guaranteed 
as long as p(p) > P2 (resp. p(/i) < p\); see e.g. Liggett, Schonmann and Stacey ]161f] . 
However, /^-independence rarely holds in Gibbs models. For instance, for plus phase of 
the Ising model with vanishing external field and inverse temperature j3 > 0, the spins 
at any two vertices are always strictly positively correlated no matter how far apart 
they are (although the correlation does tend to in the distance). 

However, in contrast to what we just saw in the case of the cubic lattices, the 
density does play a significant role for the regular trees T^. To show this we consider a 
probability measure [i on {0, where now C = Td with d > 2. The natural analogue 
of translation invariance in this setting is automorphism invariance of /U, which means 
that n inherits all the symmetries of T^. In particular, this implies that [i(X(x) = 1) is 
independent of x, so that the density p(/j,) is well-defined. As opposed to the Z d case, 
having p(fi) sufficiently close to 1 now does guarantee that an infinite cluster exists with 
positive probability. This is also true in the bond percolation case, where p{n) is defined 
as the probability that a given edge is open. The following result is due to Haggstrom 

EH- 

Theorem 5.9 For any automorphism invariant site percolation model fi on T^ with 
density p([i) > 4^", we have fi(3 an infinite open cluster) > 0. The same is true for 
bond percolation on T^ with density p(fi) > ^xj- 

These bounds are in fact sharp, in that for any p < 4jfer there exists some automorphism 
invariant probability measure on {0, l} Td with density p, which does not allow an infinite 
cluster with probability 1, and similarly for the case of bond percolation; see |118| ]. It 
follows from Example |5.8| that the corresponding threshold for Z d is trivial: only density 
1 is enough to rule out the nonexistence of infinite clusters. The intuitive reason is the 
following. On Z d , one can find finite regions A C C with arbitrarily small surface-to- 
volume ratio, which means that a vast majority of sites is not adjacent to a vertex 
outside A; we can simply take A = A n = [— n, n] d n Z rf with large n; this property of 
Zi d is known as amenability. Hence, a relatively small number of closed vertices may 
easily "surround" a large number of open sites. In contrast, every region in T^ has 
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a surface of the same order of magnitude as its volume; this makes it impossible for 
a small minority of closed vertices to surround a large number of open vertices. This 
intuition can be turned into a proof using the so called mass-transport method sketched 
below. Benjamini, Lyons, Peres and Schramm |23| have recently extended this method 
to derive a similar dichotomy for a large class of graphs, including Cayley graphs of 
finitely generated groups. 



Sketch proof of Theorem 5.9: For simplicity we confine ourselves to the case of bond 
percolation on T2. We want to show that if p{n) > 2/3, then an infinite cluster exists 
with positive /i-probability. Imagine the following allocation of mass to the edges of T2. 
Originally every edge receives mass 1. Then the mass is redistributed, or transported, as 
follows. If an edge e is open and is contained in a finite open cluster, then it distributes 
all its mass equally among those closed edges that are adjacent to the open cluster 
containing e. If e is open and contained in an infinite open cluster, then it keeps its 
mass. Closed edges, finally, keep their own mass and happily accept any mass that 
open edges decide to send them. The expected mass at each edge before transport is 
obviously 1, and one can show — this is an instance of the mass-transport principle |^3| 

- that the expected mass at a given edge is 1 also after the transport. Suppose now, 
for contradiction, that p(fJ-) > 2/3 and that all open clusters are finite /i-a.s. Then all 
open edges have mass after transport. Furthermore, since each open cluster containing 
exactly n edges has exactly n + 3 adjacent closed edges (as is easily shown by induction 

- it is here that the tree structure is used), the mass after transport at a closed edge 
adjacent to two open clusters of sizes m and 712 has mass 

ni + 3 n 2 + 3 

Hence the expected mass after transport at a given edge e is strictly less than 

3fi(X(e) =0) = 3(1- P (m)) < 1, 
contradicting the mass-transport principle. □ 

5.3 Examples of dependent percolation 

^From the previous subsection the reader might get a rather pessimistic view of the 
possibilities of establishing existence (or non-existence) of infinite clusters for depen- 
dent percolation models on Z d . This is certainly not the case, and a lot can be done. 
One standard way of determining the percolation behavior of a dependent model is by 
stochastic comparison with a suitable Bernoulli percolation model: For the existence of 
infinite clusters, it is sufficient to show that the given dependent model is stochastically 
larger than the Bernoulli model for some parameter p > p c , and the absence of infinite 
clusters will follow if the model at hand is stochastically dominated by the Bernoulli 
model for some p < p c . Let us demonstrate this technique for the Ising model on Z d . 



Consider percolation of plus spins in the plus measure a, defined in Section 4.3 
If we keep fixed then Proposition 4.16 tells us that fi^ a is stochastically increasing in 



h. Consequently, both the probability of having an infinite cluster of plus spins, as well 
as the probability that a given vertex is in such an infinite cluster, are increasing in h. 
Furthermore, as Z d is of bounded degree N = 2d, the remarks after the same propo- 
sition imply that, for any given p G (0, 1) and (3, the Ising measure fj^a stochastically 
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dominates the Bernoulli measure ip p when h is large enough, and is dominated by ip„ for 



h below some bound. We may combine this observation with Proposition 5.1 to deduce 
the following critical phenomenon: 

Theorem 5.10 For the Ising model on Z d , d>2, at a fixed temperature (3, there exists 
a critical value h c G R ( depending on d and (3 ) for the external field, such that 

[i\ ^(3 an infinite cluster of plus spins) = | ^ j| ^ ^ ^ c 

As we shall see later in Theorem |8.2| , we have h c = when d > 2 and (5 > (3 C . Higuchi 
[|133|1 has shown that the percolation transition at h c is sharp, in that the connectivity 
function decays exponentially when h < h c , and that the percolation probability is 
continuous in (/3, h), except on the critical half-line h = 0, (3 > (3 C . In Section |6| below, we 
will make a similar use of stochastic comparison arguments for random-cluster measures, 



cf. Proposition 3.11. The stochastic domination approach works also in the framework 



of lattice gases with attractive potential; see Lebowitz and Penrose | 156| . 



In the rest of this subsection we shall give some examples of strongly dependent 
systems where other approaches to the question of percolation are needed. Typically in 
these examples, the probability that all vertices in a finite region A are open (or closed) 
fails to decay exponentially in the volume of A, and as a consequence, the random field 
neither dominates nor is stochastically dominated by any nontrivial Bernoulli model. 

The geometry of level heights of a random field forms an important object of study 
both from the theoretical and the applied side. For example, it relates to the presence of 
hills and valleys on a rough surface, or to the random location of potential barriers in a 
doped semi-conductor. To fix the ideas we consider a random field X = (X(x),x G Z rf ) 
with values X(x) £ S C R which are not necessarily discrete. It is often interesting 
to divide S into two parts S\ and So and to define a new discrete random field Y 
via Y(x) = I{x(x)eSi}- For S = R one typically considers Si = [£, oo) for some level 
I E R. In this way we obtain a coarse-grained description of a system of continuous 
spins. One question is to which extent one can reconstruct the complete image from 
this information. We consider here a different question: what is the geometry of the 
random set {x 6 Zi d : Y(x) = 1}? This set is called the excursion or exceedance set 
when it corresponds, as in the example above, to the set on which the original random 
field exceeds a given level. For a recent review of this subject we refer to 0. 

We now give four examples of equilibrium systems with continuous spins where one 
can show (the absence of) percolation of an excursion (exceedance) set. Here we only 
state the results. Some hints on the proofs will be given later in Section || via Theorem 



8.1 . Details can be found in the paper by Bricmont, Lebowitz and Maes |pq ]. 



Example 5.11 Consider a general model of real- valued spins (cr(x), x *E Zi d ) with fer- 
romagnetic nearest-neighbor interaction. The formal Hamiltonian is given by 

H(a) = - £ a(x)a(y) . (21) 

The reference (or single-spin) measure A ^ 5q on R is assumed to be even and to decay 
fast enough at ±oo so that the model is well defined. Then, for any Gibbs measure 
/i relative to ( ^1~1) with ^(sgn(er(0))) > 0, there will be percolation of all sites x with 
a(x) > 0. Such Gibbs measures always exist at sufficiently low temperatures when 
d > 2. 
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Example 5.12 Consider again a spin system (<r(x), x G Z rf ), where now the Hamilto- 
nian has the 'massless' form 

x~y 

with cr(x) G R or Z and ip an even convex function. The single-spin measure A is either 
Lebesgue measure on R or counting measure on Z. The case a(x) G Z and vp(t) = \t\ 
corresponds to the so-called solid-on-solid (SOS) model of a d-dimensional surface in 
Zi d+1 ; the choice <j(x) G R and ^(i) = i 2 gives the harmonic crystal. Let \i be a Gibbs 
measure which is obtained as infinite volume limit of finite volume Gibbs distributions 
with zero boundary condition. (In the continuous-spin case, such Gibbs measures exist 
for any temperature when d > 3 and ip(t) = at 2 + 4>(t), where a > and cj) is convex 
p3| .) Then, for any I < 0, there is percolation of the sites x G Z rf with o~{x) > . 



Example 5.13 Consider next a model of two-component spins cr(x) G R 2 , x G Z , 
x cos <y^£) r x sin 4>x)i with formal Hamiltonian 

and some rotation-invariant and suitably decaying reference measure A on R 2 . Then, 
for any Gibbs measure \i with //(cos ^o) > 0, there is percolation of the sites x G Z d 
with cos^ > 0. Such Gibbs measures exist at low temperatures if d > 3. 



Example 5.14 Consider again the massless harmonic crystal of Example |5.12 above 
(with Tp(t) = t 2 ) in d = 3 dimensions. There exists a value £ c < oo so that for all t > t c 
there is no percolation of sites x G Zi d with ct(ie) > i. 

Finally, we give an example of a strongly correlated system, sharing some properties 



with the harmonic crystal of Example 5.14, where at present there is no proof of a 



percolation transition. The model is one of the simplest examples of an interacting 
particle system. What makes the problem difficult is that the random field is not 
Markov (not even Gibbsian) and not explicitly described in terms of a family of local 
conditional distributions. 

Example 5.15 The voter model is a stochastic dynamics in which individuals (voters) 
sitting at the vertices of a graph update their position (yes/no) by randomly selecting a 
neighboring vertex and adopting its position, see Liggett [|160| for an introduction. Using 
spin language and putting ourselves on Z 3 , the time evolution of this voter model is 
specified by giving the rate c(x, a) for a spin flip at the site x when the spin configuration 
is a G {+1,-1} 



3 



c(x, a) = ^ ( X ~ a ( x ) a (y)) 



There is a one-parameter family of extremal invariant measures \x v each obtained asymp- 
totically (in time) from taking the Bernoulli measure ip p with density p as initial con- 
dition. These stationary states \x v are strongly correlating. The spin-spin correlations 
decay as the inverse 1/r of the spin-distance r on Z 3 . It is an open question whether 
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for p sufficiently close to 1 the plus spins percolate, and whether for sufficiently small p 
there is no percolation. Simulations by Lebowitz and Saleur [157] indicate that there is 
indeed a non-trival percolation transition with critical value p c ~ 0.16. 

The same problem may be considered for d > 4. For d = 1, 2, however, the problem 
is not interesting because in these cases \i v is known to put mass p on the "all +1" 
configuration and mass 1 — p on the "all —1" configuration. Alternatively, one may 
consider the same model with Z 3 replaced by T^; Theorem 5.9 can then be applied to 
show that the plus spins do percolate for p > 



5.4 The number of infinite clusters 

Once infinite clusters have been shown to exist with positive probability in some per- 
colation model, the next natural question is: How many infinite clusters can exist si- 
multaneously? For Bernoulli site or bond percolation on Z rf , Aizenman, Kesten and 
Newman [l^] obtained the following, now classical uniqueness result: with probability 
1, there exists at most one infinite cluster. Simpler proofs were found later by Gandolfi, 
Grimmett and Russo [89] and by Burton and Keane ]^2|]. The argument of Burton and 



Keane is not only the shortest (and, arguably, the most elegant) so far. Also, it requires 
much weaker assumptions on the percolation model, namely: translation invariance and 
the finite-energy condition below, which is a strong way of stating that all local config- 
urations are really possible; its significance for percolation theory had been discovered 



before by Newman and Schulman [183] 



Definition 5.16 A probability measure ti on {0,1} , with C a countable set, is said to 
have finite energy if, for every finite region Ac£, 

H(X = 7] on A | X = £ off A) > 

for all 7] G {0, 1} A and [i-a. e. £ G {0, 1} AC . 



Theorem 5.17 (The Burton Keane uniqueness theorem) Let fi be a probability 
measure on {0, l} zd which is translation invariant and has finite energy. Then, fi-a.s., 
there exists at most one infinite open cluster. 

Sketch proof: Without loss of generality we can assume that /i is ergodic with respect 
to translations. For, one can easily show that the measures in the ergodic decomposition 
of ii admit the same conditional probabilities, and thus inherit the finite-energy property. 
Since the number N of infinite clusters is obviously invariant under translations, it then 
follows that N is almost surely equal to some constant k G {0, 1, . . . , oo}. In fact, 
k G {0,1, oo}. Otherwise, with positive probability each of the k clusters would meet 
a sufficiently large cube A; by the finite-energy property, this would imply that with 
positive probability all these clusters are connected within A, so that in fact k = 1, in 
contradiction to the hypothesis. (This part of the argument goes back to |183| ].) 

We thus only need to exclude the case k = oo. In this case, fi(N > 3) = 1, and the 
finite-energy property implies again that (jl{A x ) = 5 > 0, where A x is the event that x is 
a triple point, in that there exist three disjoint infinite open paths with starting point 
x. By the (norm-) ergodic theorem, for any sufficiently large cubic box A we have 

^(lA]- 1 ]T / Ai > 5/2) >l/ 2 . (22) 
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On the other hand, for geometrical reasons (which are intuitively obvious but need some 
work when made precise), there cannot be more triple points in A than points in the 
boundary dA of A. Indeed, each of the three paths leaving a triple point meets <9A, 
which gives three boundary points associated to each triple point in A. If one identifies 
these boundary points successively for one triple point after the other one sees that, at 
each step, at least one of the boundary points must be different from those obtained 
before. Hence, 

\A\~ l Y,lA x <\A\- 1 \dA\<5/2 

xeA 



when A is large enough. Inserting this into ( |22| ) we arrive at the contradiction /x(0) > 
1/2, and the theorem is proved. For more details we refer to the original paper |^| . □ 

We stress that the last argument relies essentially on the amenability property of 7i d 



discussed in Section |k2|. The finite-energy condition is also indispensable: In another 
paper [ 43 1 , Burton and Keane construct, for any k S {2, 3, . . . , oo}, translation invariant 
percolation models on Z 2 for which finite energy fails and which have exactly k infinite 
open clusters. For example, we have k = oo in Example 6.1. Fortunately, the finite- 
energy condition holds in most of the dependent percolation models which show up in 
stochastic-geometric studies of Gibbs measures. 

The situation becomes radically different when Z d is replaced by the non-amenable 
tree T^. Instead of having a unique infinite cluster, supercritical percolation models on 
Td tend to produce infinitely many infinite clusters. It is not hard to verify that this is 
indeed the case for supercritical Bernoulli site or bond percolation (except in the trivial 
case when the retention probability p is 1), and a corresponding result for automorphism 
invariant percolation on can be found in |118j ]. On more general nonamenable graph 
structures, the uniqueness of the infinite cluster property can fail in more interesting 



ways than on trees; see e.g. Grimmett and Newman [111 | and Haggstrom and Peres 

Q. 

Let us next consider the particular case of (possibly dependent) site percolation on 
Z 2 . We know from Theorem [5.17 that under fairly general assumptions there is almost 



surely at most one infinite open cluster. Under the same asumptions there is almost 
surely at most one infinite closed cluster (i.e., at most one infinite connected component 
of closed vertices). In fact, the proof of Theorem |5. 17 even shows that almost surely 



there is at most one infinite closed *-cluster. (Here, a closed *-cluster is a maximal set 
C of closed sites which is *-connected, in that any two x,y G C are connected by a 
*-path in C; *-paths were introduced in the proof of Theorem |5.3| . Any closed cluster 
is part of some closed ^-cluster.) But perhaps an infinite open cluster and an infinite 
closed *-cluster can coexist? Theorem |5,18 below asserts that under reasonably general 



circumstances this cannot happen. Under slightly different conditions (replacing the 
finite-energy assumption by separate ergodicity under translations in the two coordinate 
directions), it was proved by Gandolfi, Keane and Russo f90fl . 



Theorem 5.18 Let \i be an automorphism invariant and ergodic probability measure 
on {0, 1} Z with finite energy and positive correlations. Then 

/j,(3 infinite open cluster, 3 infinite closed *-cluster) = . 

Note that automorphism invariance in the Z 2 -case means that, in addition to translation 
invariance, [i is also invariant under reflection in and exchange of coordinate axes. Under 
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the conditions of the theorem, we have in fact some information on the geometric shape 
of infinite clusters: If an infinite open cluster exists and thus all closed *-clusters are 
finite, each finite box of Z 2 is surrounded by an open circuit, and all these circuits are 
part of the (necessarily unique) infinite open cluster. Hence the infinite open cluster is 
a sea, in the sense that all "islands" (i.e., the *-clusters of its complement) are finite. 
Similarly, if a closed *-cluster in Z 2 exists, it is necessarily a sea (and in particular 
unique) . 

The corresponding result is false in higher dimensions. To see this, consider Bernoulli 
site percolation on Z 3 . The critical value p c for this model is strictly less than 1/2 (see 
Campanino and Russo f47|), whence for p = 1/2 there exist almost surely both an 
infinite open cluster and an infinite closed cluster. 



The proof of Theorem 5.15 below is based on a geometric argument of Yu Zhang 



who gave a new proof of Harris' |128[ classical result that the critical value p c for bond 
percolation on Z 2 is at least 1/2. (Recall that this bound is actually sharp.) Zhang's 



proof appeared first in [108] and was exploited later in other contexts in [117, 120 1. 

Proof of Theorem 5.18: Let A be the event that there exists an infinite open cluster, 
let B be the event that there exists an infinite closed *-cluster, and assume by contra- 
diction that fj,(A fl B) > 0. Then, by ergodicity, fi(A n B) = 1. (This is the only use of 
ergodicity we make, and ergodicity could clearly be replaced by tail triviality or some 
other mixing condition.) Next we pick n so large that 

fj,(A n ) > 1 - 10 -3 and fJ,(B n ) > 1 - 10~ 3 , 

where A n (resp. B n ) is the event that some infinite open cluster (resp. some infinite 
closed *-cluster) intersects A n = [— n,n] 2 n Z 2 . Let A 1 ^ (resp. A%, A^ and A%) be the 
event that some vertex in the left (resp. right, top and bottom) side of the square-shaped 
vertex set A n \ A n _i belongs to some infinite open path which contains no other vertex 
of A n , and define B^ 1 , B^ and B^ analogously. Then 

A n = A^ U A^ U A^ U A^ ■ 

Since all four events in the right hand side are increasing and [i has positive correlations, 

fi(A n ) = /i(^u^u^uAf) 

< 1 - M-^nM^n )M-4n VMn ) , 

where -< indicates the complement of a set (for typographical reasons). By the auto- 
morphism invariance of /x, A^, A^, A^ and A^ all have the same ix-probability, so 
that 

^A L n ) < (1 - ^(An)) 1 " 

and therefore 

M(^) = /i(^) > 1 - (1 - M^n)) 1/4 = 1 " 10" 3/4 > 0.82 . (23) 
In the same way, we get 

H(BZ) = fM(B*) > 0.82 . (24) 
Now define the event D = A% n A% D B% n B^. From @ and (Hf) we obtain 

fi(D) > 1 - 4(1 - 0.82) = 0.28 > . 
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When D occurs, both the left side and the right side of A n are intersected by some 



infinite open cluster. By Theorem 5.17, these infinite open clusters are identical and 
separate their (common) complement into (at least) two pieces, preventing the infinite 
closed *-clusters intersecting the top and bottom sides of A n from reaching each other 
(see the picture on p. 196 of |108|]). Consequently, there exist two infinite closed *- 



clusters, in contradiction to Theorem |5.17 . □ 



Theorem 5.18 admits some variants. First, the assumption of ergodicity can be avoided 
if the assumption of positive correlations is strenghtened to the condition that fi is 
monotone in the sense of Definition [4.9|. (This is because monotonicity is preserved 



under ergodic decomposition, so that Theorem [4.11| implies positive correlations for 
each ergodic component.) Moreover, as the preceding proof shows, ergodicity is only 
needed to show that infinite clusters exist with probability either or 1, and translation 
invariance and finite energy are only used for the uniqueness of infinite clusters. We also 
need only the invariance under lattice rotations rather than all reflections, and closed 
*-clusters can be replaced by closed clusters. We may thus state the following result. 

Proposition 5.19 There exists no probability measure fi on {0, l} 7,2 which has positive 
correlations, is invariant under lattice rotations and the interchange of the states 1 
("open") and ("closed"), and satisfies 

n( 3 a unique infinite open cluster) = 1 . 

This proposition will be applied to the ferromagnetic Ising model in Section |B,2| , 
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6 Random-cluster representations 



In the previous section we saw a number of dependent percolation models. Here we shall 
focus on a particular class of such models, namely the (Fortuin-Kasteleyn) random- 
cluster model (and some of its relatives), which has turned out to be of great value 
in analyzing the phase transition behavior of Ising and Potts models. An alternative 
source for much of the material in the present section is Haggstrom [|119|j. In Sections 



6.1 and 6.2, we introduce the random-cluster model and discuss its relation to Ising 



and Potts models. This relation is then applied in Section 6.3 to prove Theorems 3.1 



and 3.2. Despite the fact that Theorems 3.1 and |3.2| concern infinite systems, these 



applications only require defining finite volume random-cluster measures. However, it 
may be interesting in its own right to study infinite volume random-cluster measures 
on graphs such as Z d ; this is done in Section 6A. In Section 6^, we describe how the 
random-cluster representation of Ising and Potts models can be used to construct highly 
efficient Monte Carlo simulation algorithms. Finally, in Section we discuss a variant 
of the random-cluster model which is applicable to the Widom-Rowlinson model rather 
than to Ising and Potts models. 



6.1 Random-cluster and Potts models 

The random-cluster model, also known as the Fortuin-Kasteleyn (FK) model after its 
inventors j8l]l , is a two-parameter family of dependent bond percolation models 

living on a finite graph. Let G = be a finite graph with vertex set C and edge 

set B. For a bond configuration n E {0, 1} S , we write k{rj) for the number of connected 
components (including isolated vertices) in the subgraph of G containing all vertices 
but only the open edges (i.e. those e E B for which n(e) = 1). 

Definition 6.1 The random-cluster measure (j)^ for G with parameters p G [0, 1] 
and q > is the probability measure on {0, 1} which to each rj G {0, 1} B assigns 
probability 




where is a normalizing constant. 

Note that taking q = 1 yields the Bernoulli bond percolation measure <p p defined at the 
end of Section |5.1| . All other choices of q give rise to dependencies between edges (as 
long as p is not or 1, and G is not a tree). 

Taking q G {2, 3, . . .} yields a model which is intimately related to the g-state Potts 
model, in a way which we will explain now. Let fip q be the Gibbs measure for the q- 
state Potts model on G at inverse temperature 0, i.e. /Vg is the measure on {1, ... , q} c 
which to each a G {1, . . . , q} c assigns probability 

where again Z^ q is a normalizing constant. 

For q G {2, 3, . . .} and p G [0, 1], let P p q be the probability measure on {1, ... , q} c x 
{0, 1} S corresponding to picking a random element of {1, . . . , q}^ x {0, 1} B according 
to the following two-step procedure. 
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1. Assign each vertex a spin value chosen from {1, . . . , q} according to uniform dis- 
tribution, assign each edge value 1 or with respective probabilities p and 1 — p, 
and do this independently for all vertices and edges. 

2. Condition on the event that no two vertices with different spins have an edge with 
value 1 connecting them. 

In other words, Pp„ is the measure which to each element (a, rj] of {1, . . . , q} c x {0, 1} B 
assigns probability proportional to 

n (^(i-p) 1 ^!^)-^))^}) • 

e=(xy)dB 

Here, (xy) denotes the edge linking x and y. The measure Pp q was introduced by 
Swendsen and Wang [21S] and made more explicit by Edwards and Sokal [73], and is 
therefore called the Edwards-Sokal measure. The following theorem states that the 
edge marginal of P^ q is a random-cluster measure, and the vertex marginal is a Gibbs 
measure for the Potts model, meaning that P^ q is a coupling of fi9 and 4>p, q - 

Theorem 6.2 Let P^' q eTtex and P^ ed9e be the probability measures obtained by pro- 
jecting Pp q on {1, ... , q} c and {0, 1} B , respectively. Then 

r>G,vertex _ ,,G /nr\ 

"p,q — Vfia \ Z0 ) 

with (3 = \ log(l — p), and 

P^ ed9e = & • (26) 

Proof: The proof is just a matter of summing out the marginals. Letting Z be the 
normalizing constant in Pp q , fixing a G {1, . . . , g}^, and summing over all i] G {0, 1} B 
we find 

P G,verte X[a) = £ p G^ v) 
rje{0,l} B 

= \ E II ^(l-p^aW^MeH} 

??G{0,1} B e=(xy)eB 

= — Y[ (1 - pY^i^^iy)} 
e={xy)(zB 

= \ ex P ( " 2 £ E 1 ) 

since Z must be equal to Z^ q by normalization. This proves (|25|). To verify ( p6[ ) we 
proceed similarly, fixing r\ G {0,1}^ and summing over a G {1, . . . ,q}^. Note that, 
given rj, there are exactly q k ^> spin configurations a that are allowed, in that any two 
neighboring vertices x ~ y with 77 ((xy)) = 1 have the same spin. We get 

^f^) = E 

o-e{i,..., 1 2} £ 

G 
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again by normalization. □ 

The Edwards-Sokal coupling of ji G and 0rL is the key to using the random-cluster 
model in analyzing the Potts model. The following two results are each other's dual, 
and are immediate consequences of Theorem |6.2| and the definition of Pp q - 

Corollary 6.3 Let p = 1 — e~ 2 @ , and suppose we pick a random spin configuration 
X G {1, . . . , q} C as follows: 



1. Pick a random edge configuration Y G {0, 1} B according to the random- cluster 
p,q 



measure 4>9 a . 



2. For each connected component C of Y , pick a spin at random (uniformly) from 
{1, . . . ,q}, assign this spin to every vertex of C, and do this independently for 
different connected components. 

Then X is distributed according to the Gibbs measure f/^ . 

Corollary 6.4 Let p = 1 — e~ 2/3 , and suppose we pick a random edge configuration 
Y G {0, 1} B as follows: 

1. Pick a random spin configuration X G {1, . . . ,q} C according to the Gibbs measure 
,.G 

2. Given X, assign each edge e = (xy) independently value 1 with probability 

j p ifX(x)=X(y) 
\ if X(x) ? X(y) , 

and value otherwise. 



Then Y is distributed according to the random- cluster measure A ' ' 



As a warm-up for the phase transition considerations in Section 3.2 , we give the following 



result as a typical application of the random-cluster representation. 

Corollary 6.5 If we pick a random spin configuration X G {1, . . . ,q} C according to 
the Gibbs measure fi9 , then for i G {1, . . . , q} and two vertices x, y G C, the two events 
{X(x) = i) and {X(y) = i} are positively correlated, i.e. 

li% q {X{x) = i,X(y) =i)> n% q (X(x) = i)4, q {X{y) = i) . 

Proof: The measure p& is invariant under permutation of the spin set {1, . . . , q}, so 
that 

^% q (X(x) =i)= fi% q (X(y) =*) = -• 
We therefore need to show that 

fi% q (X(x)=i,X(y)=i) >1. 

We may now think of X as being obtained as in Corollary |6.3| by first picking an 
edge configuration Y G {0, 1} B according to the random-cluster measure 4> G q and then 
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assigning i.i.d. uniform spins to the connected components. Given Y, the conditional 
probability that X{x) = X(y) = i is 1/q if u and v are in the same connected component 
of Y, and 1/q 2 if they are in different connected components. Hence, for some a £ [0, 1], 

H% q {X{x) = i,X(y) =i) = a - + (l- a )—>—. 

□ 

An easy modification of the above proof shows that if G is connected and j3 > 0, then 
the correlation between Iix(x)=i\ an d I{x(y)=i} ls m fact strictly positive. 

Note that the relation between the random-cluster model and the Potts model de- 
pends crucially on the fact that all spins in {1, . . . , q} are a priori equivalent. This is 
no longer the case when a nonzero external field is present in the Ising model. Several 
attempts to find useful random-cluster representations of the Ising model with external 
field have been made, but progress has been limited. Perhaps the recent duplication 
idea of Chayes, Machta and Redner |5(]] represents a breakthrough on this problem. 



6.2 Infinite-volume limits 

In this subsection we will exploit some stochastic monotonicity properties of random- 
cluster distributions on finite subgraphs of Z d . This will give us the existence of certain 
limiting random-cluster distributions, and also the existence of certain limiting Gibbs 
measures for the Potts model. 

The basic observation is stated in the lemma below which follows directly from 
definitions. 

Lemma 6.6 Consider the random- cluster model with parameters p and q on a finite 
graph G with edge set B. For any edge e = (xy) G B, and any configuration rj € 
{0, l} B \{ e \ we have that 

c f p if x and y are connected via open edges in rj 

M (e is open | rj) = j (27) 



For q > 1, Lemma p.6| means in particular that the conditional probability in fl27| ) is 
increasing in rj (and also in p). This allows us to use Holley's Theorem and the FKG 
inequality to prove the following very useful result. We write (f>p for Bernoulli bond 
percolation on G with parameter p. 

Corollary 6.7 For a finite graph G and the random- cluster measure cj)p q with p € [0, 1] 
and q > 1, we have 

(a) (f)p q is monotone, and therefore it has positive correlations, 

(b) <f>% 9 lv $ , 

(c) C^^' 



p+(i-p)g 

Furthermore, for < p\ < P2 < 1 and q > 1, we have 
(d) 
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Proof: The monotonicity in (a) is just the observation that the conditional probability 
in ( |27| ) is increasing in p and in rj. Positive correlations then follows from Theorem |4.11 . 
Next, note that (^) implies that 

' < 4>pq( e is open 1 77) < p (28) 



p+ (1 -p)q 



for all r] as in Lemma |6.6| . Theorem [L^ in conjunction with the second (resp. first) 
inequality in (p8|) implies (b) (resp. (a)). Finally, (d) is just another application of (d) 
and Theorem |4.8| . □ 

Consider now the integer lattice Z rf (for definiteness and simplicity) with its usual graph 
structure. We associate with any finite region A C Z d two specific random-cluster 
distributions which correspond to two different choices of boundary condition. The 
latter will we distinguished by a parameter 6 £ {0, 1}. Let B be the set of all nearest- 
neighbor bonds in Z d , B\ the set of all edges of B that are contained in A, and B\ the 
set of edges with at least one endpoint in A. (The difference B\ \ £>° thus consists of all 
edges leading from a point of A to a point of A c .) We then let <p b p A be the probability 
measure on {0, 1} S in which each r\ E {0, l} e is assigned probability proportional to 

i)(e) I k(ji,K) 



1 eesi J 



where k(rj, A) is the number of all 77-open clusters meeting A. We call (j)p q \ the random- 
cluster distribution in A with parameters p and q and boundary condition 6. In the 
case 6 = 0, k(-,A) is simply the number of all clusters that are contained in A; this 
corresponds to forgetting all sites in A c and is therefore referred to as the free boundary 
condition. On the other hand, suppose that A has no holes, in the sense that A c has 
no finite connected components; since we can always assume without loss of generality 
that A is connected, we call such a A simply connected. Then, in the case 6 = 1, all sites 
of A c may be thought of as being firmly wired together, whence this is called the wired 
boundary condition. 

Suppose now that A C A are two finite regions in Z d . Then $ A is obtained from 
^pq A °y conditioning on the event {n = b on B\ \ B\} which is increasing for 6 = 1 



and decreasing for 6 = 0. Hence, if q > 1 then Corollary (a) implies that 

4>p,q,A -<V 4>p,q,A and 4>l,q,A 4>l,q,A Wnen A C A , (29) 

in complete analogy to (|I4|). Moreover, we obtain the following counterpart of Proposi- 
tion |4.14| on the existence of infinite- volume limits. We write A f Z d for the limit along 
some (any) increasing sequence of finite simply connected subsets of Z d , converging to 
Z d in the usual way. 

Lemma 6.8 For p £ [0, 1] and q > 1, the limiting measures 

4> h pA = lim $ A , 6 G {0, 1} , 
A|Z d 

exist and are translation invariant. 

This convergence result has consequences for the convergence of Gibbs distributions for 
the Potts model, as we will show next. Let q S {2, 3, . . .}, and for i £ {1, . . . , q} and 
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any finite region A in Z d let f/p A denote the Gibbs distribution in A for the Potts 
model at inverse temperature (3 with boundary condition r] = i on A c . For i = 0, let 
M/3 ? a be the corresponding Gibbs distribution with free boundary condition, which is 
defined by letting C = A in @, i.e., by ignoring all sites outside A; we think of /J^ ^ as 

a probability measure on the full configuration space {1, . . . , q} zd by using an arbitrary 
extension. 



Still for i = 0, Theorem 6.2 shows that u9> „ A and otf „ a admit an Edwards-Sokal 



Vf3, q ,A ana <Pp, q ,A 

coupling (on A) when p = 1 — e~ 2 @. A similar Edwards-Sokal coupling is possible for 
i € {1, ...,«?} when A is simply connected. Indeed, let P % v A be the probability measure 
on {1, . . . , q} c x {0, 1} B corresponding to picking a random site-and-bond configuration 
according to the following procedure. 

1. Assign to each vertex of A c value i, and to all edges of B \ B\ value 1. 

2. Assign to each vertex in A a spin value chosen from {1, . . . , q} according to uniform 
distribution, assign to each edge in B\ value 1 or with respective probabilities p 
and 1 — p, and do this independently for all vertices and edges. 

3. Condition on the event that no two vertices with different spins have an edge with 
value 1 connecting them. 

It is now a simple modification of the proof of Theorem [T^ to check that the vertex and 
edge marginals of P^ q ^\ are /i^ ?A and 0p j?) A) respectively. (Note that by the simple 
connectedness of A there is always a unique component containing A c .) Analogues of 



Corollaries 3.2 and p73 follow easily. This leads us to the following result extending 



Proposition |4.14j to the Potts model. 



Proposition 6.9 For any i £ {0, 1, . . . , q}, the limiting probability measure 

V%q = Mftg.A 

on {1, . . . , q} z<i exists and is a translation invariant Gibbs measure for the q-state Potts 
model on 7i d at inverse temperature (3. 

Proof: In view of the general facts reported in Section 2.6, the limits are Gibbs measures 
whenever they exist. We thus need to show that ^ 9 a(/) converges as A j Z d , for any 
local observable /. For definiteness, we do this for i £ {1, ...,g}; the case i = is 
completely similar. 

Fix an / as above, and let A C Z d be the finite region on which / depends. As 
shown above, for a simply connected A we may think of a {1, ... , q} z -valued random 
element X with distribution /A ^ as arising by first picking an edge configuration 
Y G {0, 1} A according to 0^ A (with p = 1 — e~ 2/3 ) and then assigning random spins to 
the connected components, forcing spin i to the (unique) infinite cluster. For x, y G A, 
we write {x «-> y} for the event that x and y are in the same connected component in 
Y, and {x «-> oo} for the event that x is in an infinite cluster. Clearly, the conditional 
distribution of / given Y depends only on the indicator functions (I{ x ^ y }) x ,y€A and 
(^{i«oo})ieA, since the conditional distribution of X on A is uniform over all elements 
of {1, ... , q} A such that firstly X(x) = X(y) whenever x *-* y, and secondly X(x) = i 
whenever x *-* oo. Hence, the desired convergence of M3qa(/) follows if we can show 
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that the joint distribution of ( I{ x <- >y})x,yeA and (I{ x ^oo})x&A converges as n — > oo. This, 
however, follows from Lemma ^8| upon noting that (I{ x ^y}) x ,yeA and (I{ x +->oo})xeA are 
increasing functions. □ 

6.3 Phase transition in the Potts model 



As promised, this subsection is devoted to proving Theorems |3.l| and p.2| , using random- 



cluster arguments. The original source for the material in this subsection is Aizenman, 



Chayes, Chayes and Newman P]; see also [119] for a slightly different presentation. 



We consider the Potts model on Z , d > 2. All the arguments to be used here, 
except those showing that the critical inverse temperature j3 c is strictly between and 
oo, go through on arbitrary infinite graphs; we stick to the Z d case for definiteness 
and simplicity of notation. We consider the limiting Gibbs measures fi l g obtained in 
Proposition |6.9| . For i G {1, . . . , q}, these play a role similar to that of the "plus" and 
"minus" measures fi^ an d fig for the Ising model. In fact, we have the following result 
which extends Theorem 4.15| to the Potts model and also gives a characterization of 



phase transition in terms of percolation in the random-cluster model. 

Theorem 6.10 Let f3 > and p = 1 — e _2/3 . For any x G Z d and any i G {1, . . . , q], 

the following statements are equivalent. 

(i) There is a unique Gibbs measure for the q-state Potts model on Z d at inverse 

temperature j3. 

(ii) 4 >q (X(x) =i) = l/q. 
(hi) <t>l A (x <-> oo) = 0. 

As we will see in a moment, it is the percolation criterion (iii) which is most convenient 
to apply. In this context we note that 

4>l, q {x «-> oo) = mf <j)l qA (x ^ A c ) = hm ^^{x «-► A c ) , (30) 

where {x <-> A c } stands for the event that there exists an open path from x to some 
site in A c . This follows from (^) and the fact that {x <-> A c } decreases to {x «-> oo} 
as A t Z d . 

The usefulness of the percolation criterion is demonstrated by the next result which 
extends the scenario for Bernoulli percolation to the random-cluster model. Together 
with Theorem 5.1C , this gives Theorem |3.2| with (3 C = | log(l — p c ). 



Proposition 6.11 For the random- cluster model on Z d , d > 2, and any fixed q > 1, 
there exists a percolation threshold p c G (0, 1) (depending on d and q) such that 



<t>l, q (x <-» oo) 



= forp<p c , 
> for p > p c . 



Proof: The statement of the proposition consists of the following three parts: 

(i) <f>p t q(x *-* oo) = for p sufficiently small, 

(ii) (ftp^gix <-> oo) > for p sufficiently close to 1, and 
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We first prove (i). Suppose p < p c (Zi d , bond), the critical value for Bernoulli bond 
percolation on Z rf . For e > 0, we can then pick A large enough so that 

<p p (0 «-► A c ) < e. 

By Corollary |T7| (b), we have that the projection of 4> p on {0, stochastically dom- 
inates the projection of ^ A on {0, 1} s a for any A D A, so that 

#,„a(0 ^ oo) < ^ g>A (0 <-> A c ) < e 
for any A D A. Since £ was arbitrary, we find 

<^ a(0 <-> oo) = 



which in conjunction with (30) implies (i). 

Next, (ii) can be established by a similar argument: Let p be such that p* = 
p/[p + (1 — p)q] > p c (Z d ,bond). Corollary (c) then shows that 4>p q ^ (f>p* for 
every A, so that 

lim >i g ,A(° ^ OO) > 0, 

ATZ d y 

proving (ii). 

To check (iii) we note that Corollary (d) implies that, for any A, 

0p 1)9 ,A 0p 2 ,g,A whenever p x < p 2 . (31) 
This proves (iii) and thereby the proposition. □ 



Before the proof of Theorem 3.10 we need another definition and a couple of lemmas. 
For a finite box A in Z d and a spin configuration £ G {1, . . . , let 

A| = {x G dA : f (a?) = i} 

for i = 1, . . . ,q. We now define the random- cluster distribution <pp q a f or ^ with bound- 
ary condition £, as the probability measure on {0, 1} k a which to each r/ G {0, 1} b a 
assigns probability proportional to 



('/) 



where kt (rj) is the number of connected components in r\ that do not intersect dA, and 
,D(£, 77) is the event that there is no open path in r\ connecting any two vertices in 

and yl| for any i ^ j. 

Lemma 6.12 Let p = 1 — e -2 ^ 3 , let K be a finite region in Z d ; and fix some boundary 
condition £ E {l,...,g} 9A . Suppose that we pick a random spin configuration X G 
{1, . . . , q} A as follows. 



1. Pick Y G {0, 1} b a according to 



q ,A- 



48 



2. For each i G {l,...,q} and each connected component C in n intersecting A^, 
assign spin i to every vertex in C . 

3. For all other connected components C in Y, pick a spin at random (uniformly) 
from {1, . . . , q}, assign this spin to every vertex of C, and do this independently 
for different connected components. 

Then X is distributed according to the Gibbs distribution /jl^ a for the Potts model on 
A with boundary condition £. 



Proof: This is a straightforward generalization of the proofs of Theorem 6.2 and Corol- 
lary |J. □ 



Lemma 6.13 With notation as above, we have, for any £ G {1 
projection of A on {0, 1} b a stochastically dominates 4^ pq \- 

Proof: Just write down single-edge conditional distributions for <ffi D 
Lemma B.6I) and invoke Theorem 



dA 



that the 



and 



[as m 

□ 



We are finally ready for the proof of Theorem 6.10 



Proof of Theorem S.10: We begin with the implication (i) =>- (ii). If there is a unique 
Gibbs measure for the (/-state Potts model on Z d at inverse temperature j3, then we have 
in particular that 



4,1 



But since by symmetry /j^ q (X(x) = i) = q {X{x) = j) for any i,j G {1, . . . , q}, we 
must have [A (X(x) = i) = 1/q, and (ii) is established. 

Next we turn to the implication (ii) (hi) . By the Edwards-Sokal coupling of edge 
and site processes introduced before Proposition |6.9|, we have 



4, q (x(p) = i) 



hm tf p K {X(Q) 
ATZ d 







q q 



■ l im >?U,A(0 

A-[Z d F 



A c ) 



(32) 



Together with (|3C|), the result follows. 

Most of the work is needed for the implication (hi) (i). Roughly speaking, the 
absence of percolation in the random-cluster model implies that every finite region is cut 
off from infinity by a set of closed edges. Thus, independently of what happens macro- 
scopically, the local spins feel as if they are in a system with free boundary condition. 
This makes a phase transition impossible. 

To make this intuition precise we let \x be an arbitrary Gibbs measure for the Potts 
model at inverse temperature (5. We will show that fj, = fjp , the limiting measure with 
free boundary condition obtained in Proposition 6.£. We fix any local observable / and 
some e > 0. We then can find a finite box A C Z d such that 



< e for all finite T D A . 



In view of ( |30| ) we can also choose a finite box A D A satisfying 4>p, q ,A( x *~* 
for all x G A, and thus 



(33) 

A c ) < e/AI 



(A <-» A c ) < e 
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Here, {A «-> A c } is the event that there exists an open path from A to A c . 

Consider the complementary event C = {A *-* A c } c . For any edge configuration 
77 £ C, A is cut off from A c by a set of closed edges. Indeed, let T be the union of all 
77-open clusters meeting A. Then 

(a) A C T C A, and 

(b) 77(e) = for all edges from T to T c , i.e., 77 = on /3f \ B®. 

Since these properties are stable under finite unions, there exists a largest set r(?7) 
satisfying (a) and (b). The maximality implies that, for each fixed region T, the event 
{77 : r(?7) = r} only depends on the status of the edges off i3 r . 

Coming to the core of the argument, we fix any boundary spin configuration £ £ 
{1, . . . , q} 9A and consider the Edwards-Sokal coupling P = P^ A of /i| A and <fr p A 

'''' " 'z d 



introduced in Lemma |6. 12 . We write X for the random spin configuration in {1, . . . , q}' 



and Y for the random edge configuration in {0, 1} B . Then P(f o X) = p^ A (/), and 
P(Y £ C) = <4, A (C) > < ?jA (C) > 1 - e 



by Lemma 3.13 and the choice of A. Assuming without loss of generality that ||/|| < 1, 



we can therefore conclude that 

P(foX\Y£C)-^ q Jf)\<2e. (34) 

However, the conditional expectation on the left is an average of the conditional ex- 
pectations P(f o X I r(y) = r) with A C r C A, and these in turn are averages of 
conditional expectations of the form 

P(f o X I Y = r) off £f, Y = on /3f \ B r ) 

which, by construction of P, are equal to Mfl g r(/)- Together with ( |33[ ) and (p^), we 
conclude that 



< 3e . 



Taking the /i-average over £ and letting e — > we finally get fj,(f) = /J^ q (f), and the 
proof is complete. □ 

6.4 Infinite volume random-cluster measures 

The random-cluster arguments used in the previous subsections for studying infinite 
volume Ising and Potts models only required defining finite volume random-cluster dis- 
tributions, although we have seen already the limiting random-cluster measures 4^>,q- 
(Their existence was convenient in the formulation of Theorem 6.10| , but not really 



needed for the arguments.) Recent years have nevertheless witnessed a rapid develop- 
ment of a theory for infinite volume random-cluster measures, defined in the DLR spirit. 
Here we shall discuss the basics of such a theory. A similar theory of infinite volume 
Edwards-Sokal measures for joint spin and edge distributions was recently developed in 

Let G be an infinite (locally finite) graph with vertex set C and edge set B. Fix 
p £ [0, 1] and q > 0, and let B C B be a finite simply connected region; since the random- 
cluster model lives on edges rather than on vertices, we let "region" refer to edge sets 
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rather than vertex sets in this subsection. Let V{B) = {x G C : Be £ B incident to x}. 
For an edge configuration £ G {0, 1} B ° , define the random- cluster distribution <jj^& in B 
as the probability measure on {0, l} e in which each r\ G {0, l} e is assigned probability 
proportional to 

hn=t off 5} { II P" (e) (l -P) 1- ^ j , (35) 

VeeB ) 

where k(r], B) is the number of connected components of n which intersect V{B). This is 
a generalization of the random-cluster distributions cf) b p A defined in Section ^3^, which 
are recovered by taking B = B\ and £ = 6, b G {0, 1}. It is easy to see that the random- 
cluster distributions are consistent in the sense that conditioning on a configuration in 
some B' C B yields the corresponding random-cluster distribution in B \ B'. 

Definition 6.14 A probability measure 4> on {0, 1} B is said to be a random- cluster 
measure with parameters p and q if its conditional probabilities satisfy 

^10= in B | £ off B)=<p^(rj) 

for all finite simply connected B C B, <p-almost all £ and all r\ such that rj = £ off B. 

This is the direct analogue of the definition of a Gibbs measure in the random-cluster 
setting. There is, however, also another possibility which differs from the preceding one 
for graphs in which the complements of finite regions are not connected. The idea is to 
connect all infinite clusters at infinity. This corresponds to a one-point compactification 
of G. Accordingly, we shall use a prefix 'C which stands for "compactified" . Thus, we 
define a C -random- cluster distribution <pp& as in (p5|), except that k(n,B) is replaced 

by k(rj,B), defined as the number of all finite connected components of n intersecting 
V{B). 

Definition 6.15 A probability measure (f> on {0, 1} B is said to be a C-random- cluster 
measure for p and q if its conditional probabilities satisfy 

for all finite simply connected B C B, ^-almost all £ and all r\ such that rj = £ off B. 
The study of random-cluster measures in the case G = Z d was initiated by Grimmett 



109(|, and about simultaneously by Pfister and Vande Velde |191| and Borgs and Chayes 



3; see also Seppalainen [211 1 for some even more recent developments. The C- variant 



(Definition |6.15| ) was introduced in the regular tree case by Haggstrom [114, 115], and 



further studied in a general graph context by Jonasson |13S| ]. In the following, we will 
try to convince the reader that random-cluster measures of both types are of interest, 
and also discuss their relation to each other. 

We shall concentrate mainly on the case q > 1. The reason for this is that it is only 
for q > 1 that the conditional probability in ( p7| ) is increasing in r/, which allows the 
use of the stochastic domination and correlation results in Section || (Theorems 4^ and 



4 . 1 1| ) . For q < 1, these tools are not available, and for this reason the random-cluster 



model with q < 1 is much less understood than the q > 1 case, although in Grimmett 



109[| , Haggstrom [112] and Seppalainen [211] one can find at least some results in the 



q < 1 regime of the parameter space. 
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We write <^'° for <j>^^ with £ = 0, and <pp£ for (j)p ^ with £ = 1. We can omit 
the hat when G = Z d , because there is always exactly one infinite cluster regardless 
of the configuration on B (this is related to Proposition 6.19| below). On the other 



hand, the two different ways of counting clusters with wired boundary condition are not 
equivalent for all graph structures; a simple counterexample is G = in which the 
wired boundary condition gives rise to several infinite clusters. 

Still in the context of general infinite graphs, we write B j B for the limit along 
some (any) sequence of finite simply connected regions increasing to B in the usual way. 



In complete analogy to (p2£|) and Lemma 6.8 we then obtain the following monotonicity 
and convergence result. 

Lemma 6.16 For p £ [0, 1], q > 1, and any two finite bond sets B\ C B2, we have 

(i) ^p,q'° —V 4>p?q > so that the limit (\P p q = lim exists; and 

(ii) d)^ 1 ' 1 >~t> <i>n 2 a l , so that the limit &\„= lim 6^1} exists. 

Now let (j> be any random-cluster measure of either type, compactified or not, with the 
given parameters p and q. Further application of Theorem [L^ (or Corollary |6.7| ) implies 
that 



This is analogous to the sandwiching relation ( |i5|) for the Ising model 



Furthermore, the arguments of Section 6.3 go through to show that the q-state Potts 



model on G at inverse temperature (3 has a unique Gibbs measure if and only if the 
4>p ^-probability of having an infinite cluster is 0, where as usual p = 1 — e~ 2/3 . 



For Definitions |6.14| and p,15| to be of interest, we have to establish at least the 
existence of random-cluster measures of the two types. The following theorem tells us 
that at least for q > 1, such measures do exist. (The existence problem for q < 1 
remains open in the setting of general graphs, although existence has been established 
for Z d and T d ; see [p.09|| and respectively.) 

Theorem 6.17 For p G [0,1] and q > 1, we have that (i) (fip q is a random- cluster 
measure, and (ii) <j>p „ is a C-random- cluster measure. 

For the proof we use the following lemma which characterizes random-cluster measures 
in terms of single-edge conditional probabilities. For e = (xy) £ B and £ G {0, l} s ^ e ^, 
we write as usual {x <-> y} for the event that there exists an open path in £ from x to y. 
We also write {x < — ► y} for the event that there either exists an open path in £ from 

x to y, or x and y are both in infinite clusters of £. We think of this C-connectivity 

c 

notion x * — > y as allowing paths between x and y to go "via infinity" . 

Lemma 6.18 Fix p £ [0,1] and q > 0, and let cj) be a probability measure on {0,1} . 
Then <j) is a random- cluster measure for p and q if and only if for each e = {xy) £ B 
and (p-a.e. £ £ {0, l} 6 ^} we have 

f p if x <-> y in £ 

^ (eiS ° Pen|e) = \ pT^q otherwise, ^ 



Similarly, (f> is a C-random- cluster measure for p and q if and only if (3t) holds with 



x < — > y instead of x <-> y. 
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Proof: We consider only the first statement, as the C-case is completely similar. For 



the "only if" part we only need to note that the right-hand side of (36) is equal to 
(ftp^q'^(e is open ). Passing to the "if" part, we may restrict ourselves to the case of 
p £ (0, 1). Assume that <p satisfies (|36| ) for each e = (xy) £ B and 0-a.e. £ £ {0, l} B ^ e h 
Let B C 6 be some finite edge set. We need to show that the conditional distribution 
of (j> given the configuration £ on B c equals (f)p ™ for 4>-a.e. £. For this, it suffices 
to check that for any two configurations 77, 7/ £ {0, l} e which agree with £ on B c we 
have 

MIO ^ {Xl e&B P^ e Xl-p) X -^}q k ^ B ) 



Hv' 1 e) {n e6S ^ /(e) a - p)i-*(z)} q Hv',B) 

with /c defined as in (|35|). If 77 and 7/ differ only at a single edge e, then k(rj,B) 



(37) 



k(rj',B) = k(r],e) — k(rj',e), whence ( |37| ) is immediate from (36). In the general case, 
we interpolate 77 and 7/ by a sequence of configurations which successively differ in at 
most one edge, and use a telescoping argument. □ 



Proof of Theorem 6.17 : We prove (ii) only, as (i) follows from a similar argument 
and is also better known, see e.g. Borgs and Chayes Fix e = (xy) £ B. By Lemma 

5,18| , it is sufficient to establish (36) with the C-connectivity relation < — > in place of 



the standard connectivity relation 

Let Bi, f?2, • • ■ be an increasing sequence of finite edge sets containing e and con- 
verging to B in the usual sense. We write, with slight abuse of notation, £(-E>i) for the 
restriction of £ to Bi\{e}. We recall from the martingale convergence theorem that 

<f>p^ q (e is open | £) = lim 4>p q (e is open | £,(Bj)) (38) 

for e^-a.e. £ £ {0,l} B ^ {e} - 

We suppose first that x < — > y fails in £. Then at least one of the vertices x and 
y is in a finite cluster of £, and consequently there is some m (depending on £) such 

that -i(x < — > y) can be verified by just looking at ^(B m ). (This is a consequence of the 
special concept of C-connectivity.) For any n > j > m we then have 



( e is °P en I = p+ (l-p) q 



so that by the definition of tpl „ we get 



p,q 

^ g (e is open | £(Bj)) - 



p+ (1 -p)^ 



by letting n — > 00. Then we let j — > 00 and use (pq) to deduce the C-version of ( p6| ) in 



the case £ ^ {x > y}. 

c _____ 

We go on to the case £ £ {x < — > y}. In analogy to p8]), we have 

lim <g (x^y|,(S J -)) = l 

for „-a.e. £ £ {x <— > 7/}. For such £ and any e > 0, we can thus find an m (depending 
on £) such that 

4> l p , q {x^y\(i{Bi))>l-e (39) 
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for any j > m. Next we use the definition of <j)p q - For any n = 1, 2, . . ., let Y and Y n be 
{0, l} B -valued random edge configurations with distributions 4>^ q and ^g' 1 satisfying 
Y n y Y; this is possible by Lemma 3.16. We write P n for the probability measure 
underlying this coupling. By the same lemma and the order relation Y n y Y, we have 

lim P n (Y n (B j )^Y(B j )) = 0. (40) 

c c 
Since 7 n G {x < — > y} whenever Y G {x < — > y}, we can write 

< P n ({x ^ y in y n , y n (^) = A {x <-^-> y in Y, Y(Bj) = £{Bj 

where A denotes symmetric difference. Since {x < — ► y} is the decreasing limit of the 
local events {x «-> y in A} U {x «-> A c , y <-> A c } as A | £, an analogue of ( |30| ) together 
with (40) shows that the last expression tends to zero as n — ► oo. It follows that 

Hm ^(x ^ y | £(£,-)) = ^ y \ £(!*,•)) 

which is at least 1 — e by (^9|). But since <^JJ' 1 (e is open | £') = p for each n and all 
£ G {x i > y}, we get 

p — e < lim <j)p^ l {e is open | £(Bj)) < p . 

Hence, 



P - e < 4>^ q (e is open | £(Bj)) < p , 



and since e was arbitrary we can use ( |38|) to deduce the C-version of (|36|) in the case 
£ G {x < ► y}. □ 

Let us now briefly address the issue of whether the two types of random-cluster measures 
are any different. The following result says that very often they are the same. 

Proposition 6.19 Let cf> be a probability measure on {0, 1} B with 

(j>(3 at most one infinite open cluster) = 1 . 

Then, for any p £ [0, 1] and q > 0, (j) is a random- cluster measure for p and q if and 
only if it is a C -random- cluster measure for p and q. 

This means that whenever "uniqueness of the infinite cluster" can be verified, the two 
types of random-cluster measures coincide. An example is obtained if we consider trans- 
lation invariant random-cluster measures for Z d , since the Burton-Keane uniqueness 
theorem (Theorem 5.17) applies in this situation. For Z d , the measures (j)® and <$pq 
are translation invariant, by Lemma 6.8 . On the other hand, uniqueness of the infinite 
cluster typically fails on trees, leading to very different behavior for the two types of 
random-cluster measures; see [114, 115] for a discussion. 
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Proof of Proposition |6.19| : For p = or 1 the result is trivial, so we may assume 
that p £ (0, 1). The conditional probabilities in ( |36| ) and its C-counterpart differ only 
on the event 

A xy = {x < — ► y} \ {x <-> y} . 

Hence if <j) is a random-cluster measure but not a C-random-cluster measure (or vice 
versa), then A xy has to have positive ^-probability for some edge e = (xy) £ B. But 
then the event A xy C\{e is closed} has positive (^-probability, and since this event implies 
the existence of at least two infinite clusters, we are done. □ 

Much of the study of infinite volume random-cluster measures that has been done so 
far concerns the issue of uniqueness (or non-uniqueness) of random-cluster measures. 
A discussion of this issue would, however, lead us too far, so instead we advise the 



reader to consult Grimmett [109], Haggstrom [114] and Jonasson [139] to find out what 



is known and what is conjectured in this field. 

6.5 An application to percolation in the Ising model 

In Theorem |5.10| we have seen that the probability of percolation of plus spins in the 
Ising model is an increasing function of the external field. A much harder question is 
to determine monotonicity properties of percolation probabilities as (3 (rather than h) 
is varied. An interesting open problem is to decide whether for G = Z d , d > 2, the 
probability 

p^t{x oo) 

is increasing in (3. Here we write fit for the plus phase in the Ising model at inverse 

temperature (3 with external field h = 0, and {x < — > oo} is the event that there exists 
an infinite path of plus spins starting at x. At first sight, one might be seduced into 
thinking that this would be a consequence of the connection between Ising and random- 
cluster models, and the stochastic monotonicity of random-cluster measures as p varies; 
see (|3l|). However, such a conclusion is unwarranted. For example, in the coupling 
of Theorem |6.2| the existence of an open path between x and y in the random-cluster 
representation is a sufficient but not necessary condition for x and y to be in the same 
spin cluster. In fact, Haggstrom |11£| ] showed, by means of a simple counterexample 
and in response to a question of Cammarota [jig] , that the probability that x and y are 
in the same spin cluster need not be increasing in /3, and similarly for the expected size 
of the spin cluster containing x. 

However, when the underlying graph is a tree, monotonicity in (3 of the probability 
of plus percolation can be established: 

Theorem 6.20 For the Ising model on the regular tree T^, d > 2, with a distinguished 
vertex x, the percolation probability n~t(x oo) is increasing in (3. 

An interesting aspect of this result is that its proof, unlike those of the monotonicity 
results mentioned earlier in this section, is not based on stochastic domination between 
the probability measures in question. In fact, stochastic domination fails, i.e. it is not 



always the case (in the setting of Theorem 3.20 ) that 

/4x ^ 4* ( 41 ) 



55 



when fa < fa. An easy way to see this is as follows. Just as in Theorem |3.1| , let 
P c be the critical inverse temperature for non-uniqueness of the Gibbs measure. (It is 
straightforward to show, using either the random-cluster approach or the methods in 
Section g, that fa > for C = T d .) Pick fa < fa in (0, fa). By Theorem |4.15j we then 



have : £{y) = +1) = M*g 2 (£ : C(y) = +1) = 1/2 for every vertex y. If now ([TJ) 



was true we would have /it = /lt 2 by Proposition 4.12. This, however, is impossible 



because the two measures have different conditional distributions on finite regions. 



Theorem 3.20 can be proved using the exact calculations for the Ising model on T^, 



which can be found in e.g. Spitzer J215| | and Georgii [96|. Here we present a simpler 
proof which does not require any exact calculation, but which exploits random-cluster 
methods. 



Proof of Theorem |6.20 : As usual we write £ and B for the vertex and edge sets of 
Td- Since 

for any f3 in analogy to (30), it suffices to show that for fa < fa and any A, we have 

This we will do by constructing a coupling P of two { — 1, +1} -valued random objects 
X\ and X2 with respective distributions \it x A and fit A and the property that if x 
00 in Xi, then the same thing happens in X%. 

Recall the Edwards-Sokal coupling of spin and edge configurations described ahead 



of Proposition 6.9. In the present case of the tree T^, this construction requires 
the C-version of counting clusters, which corresponds to making the complement of 
A connected. Therefore we will work with the C-random-cluster distributions. Let 
pi = 1 — e -2 ^ 1 and P2 = 1 — e -2 ^ 2 , and let B = B\ C B be the set of edges incident to at 
least one vertex in A. We first let Y\ and Y2 be two {0, l}^-valued random edge config- 
urations distributed according to the random-cluster measures 4>p^\ and 4>p^\, and such 

that P(Yi ^ Y2) = 1; this is possible by the ^^"analogue of (|3X|) . X\ and X2 can now 
be obtained by assigning spins to the connected components of Y\ and Y2 in the usual 
way; these spin assignments are coupled as follows. First we must assign spin +1 to all 
infinite clusters in Y\ and Y%. Then we let (Z(y)) y& \ be i.i.d. random variables taking 
values +1 and —1 with probability 1/2 each, and assign to each finite cluster C of Y\ 
and Y2 the value Z(y), where y is the (unique) vertex of C that minimizes the distance 
to x. This defined X\ and Xi- A moment's thought reveals that the set of vertices that 
can be reached from x via spins in X\ is almost surely contained in the corresponding 
set for X2. Hence ( fi"2"| ) is established, and we are done. □ 

Note that this proof did not use any property of T^ except for the tree structure, so 
Theorem 3.20| can immediately be extended to the setting of arbitrary trees. 



6.6 Cluster algorithms for computer simulation 

An issue of great importance in statistical mechanics which we have not touched upon 
so far is the ability to perform computer simulations of large Gibbs systems. Many 
(most?) questions about phase transition behavior etc. can with current knowledge only 
be answered partially (or not at all) using rigorous mathematical arguments. Computer 
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simulations are then important for supporting (or rejecting) heuristic arguments, or (in 
case not even a good heuristic can be found) to provide ideas for what a good conjecture 
might be. This topic is somewhat beside the main issue of our survey, but since random- 
cluster representations have played a key role in simulation algorithms for more than 
a decade we feel that it is appropriate to describe some of these algorithms. In fact, 
it was the need of efficient simulation which, in the late 1980's, sparked the revival of 
the random-cluster model (Swendsen and Wang [ |219| ]) which up to then had raised only 
little interest since its introduction by Fortuin and Kasteleyn in the early 1970's. 

Consider for instance the Ising model with free boundary condition on a large cubic 
region A C Z d . Direct sampling from the Gibbs distribution [ihfi,K with free boundary 
condition is not feasible, due to the huge cardinality of the state space fi, and the 
(related) intractability of computing the normalizing constant for the Gibbs measure. 
The most widely used way to handle this problem is the Markov chain Monte Carlo 
method, which dates back to the 1953 paper by Metropolis et al. [178]. The idea is 
to define an ergodic Markov chain having as unique stationary distribution the target 
distribution Hh,p,A- Starting the chain in an arbitrary state and running the chain 
for long enough will then produce an output with a distribution close to the target 
distribution. An example of such a chain is the single-site heat bath algorithm, whose 
evolution is as follows. At each integer time, a vertex x £ A is chosen at random, and 
the spin at x is replaced by a new value according to the conditional distribution (under 
M/i,/3,a) of the spin at x given the spins at its neighbors. It is immediate that fJ>h,/3,A is 
stationary for this chain, and ergodicity of the chain follows from elementary Markov 
chain theory upon checking that it is aperiodic and irreducible. The problem with this 
approach is that the time taken to come close to equilibrium may be very long. For 
example, let h = 0. Then, for (3 < f3 c (with (3 C defined as in Theorem 3.1), the time 
taken to come within a fixed small variational distance from the target distribution 
grows only like n log n in the size of the system (here n is the number of vertices in A) 
whereas in contrast the time grows (stretched) exponentially in the size of the system for 



(3 > f3 c ; see e -g- [169, 168]. This means that simulation using this heat bath algorithm is 



computationally feasible even for fairly large systems provided that (3 < (3 C , but not for 
(3 > 0c- What happens for P > (3 c is that if the chain starts in a configuration dominated 
by plus spins, then the plus spins continue to dominate for an astronomical amount of 
time, and similarly for starting configurations dominated by minus spins. The set of 
configurations where the fraction of plus spins is around 1/2 (rather than around the 
fractions predicted by the magnetization in the infinite- volume Gibbs measures /it and 
fip) has small probability and thus can be seen as a "bottleneck" in the state space, 
slowing down the convergence rate. 

A way to tackle the exceedingly slow convergence rate in the phase coexistence 
regime is to use the heat bath algorithm for the corresponding random-cluster model 
rather than for the Ising model itself, and only in the end go over to the Ising model 
by the random mapping described in Corollary |6.3| . This has the disadvantage that 
the calculation of single-site (or, rather, single-edge) conditional probabilities become 
computationally more complicated due to the possible dependence on edges arbitrarily 
far away (see Lemma |6.6[ ). This disadvantage, however, seems to be by far outweighed 
by the fact that the convergence rate of the Markov chain (for (3 > (3 C ) appears to be 
very much faster than for the heat bath applied directly to the spin variables. The 
reason for this phenomenon is that the random-cluster representation "doesn't see any 
difference" between the plus state and the minus state. This approach can of course 
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be used also for the q > 3 Potts model, and is due to Sweeny [218]. Later, Propp and 
Wilson |20[1(| built on this approach by coupling several such Markov chains (i.e. running 
them in parallel) in an ingenious way, producing an algorithm which runs for a random 
amount of time (determined by the algorithm itself) and then outputs a state which 
has exactly the target distribution. The running time of this algorithm turns out (from 
experiments) to be moderate except for the case of large q and (5 close to the critical 
value. The Propp-Wilson approach, known as exact or perfect simulation, has received 
a vast amount of attention among statisticians during the last few years (see e.g. the 
annotated bibliography |225| ]) and we believe that it has interesting potential also in 
physics. 

There is, however, another Markov chain which appears to converge even faster 
than those of Sweeny, Propp and Wilson. We are talking about the Swendsen-Wang 
[|219H algorithm, which runs as follows for Ising and Potts models on a graph with 
vertex set C and edge set £>: Starting with a spin configuration Xq G {1, . . . ,q} c , a 
bond configuration Yq 6 {0, 1} B is chosen according to the random mapping defined in 
Corollary pM Then another spin configuration X\ is produced from Yq by assigning 
random spins to the connected components, i.e. by the random mapping of Corollary 



673[ This procedure is then iterated, producing a new edge configuration Y\ and a new 
spin configuration X2, etc. By combining the two corollaries, we see that if Xq is chosen 
according to the target distribution, then the same holds for X±, and consequently for 
X2, A3, .... In other words, the target distribution is stationary for the chain {Xk}^ = Q, 
and by the (easily verified) ergodicity of the chain we have a valid Markov chain Monte 
Carlo algorithm. Although it is not exact in the sense of the Propp-Wilson algorithm, 
it appears to converge much faster, thus in practice allowing simulation of systems that 
are orders of magnitude larger. Heuristically, the reason for this faster convergence is 
that large chunks of spins may flip simultaneously, allowing the chain to tunnel through 
any bottlenecks in the target distribution. However, rigorous upper and lower bounds 
on the time taken to come close to equilibrium are to a large extent lacking, although Li 
and Sokal |159| have provided a lower bound demonstrating the phenomenon of "critical 
slowing down" as [5 approaches (5 C - 

The Swendsen-Wang algorithm has, since its introduction in 1987, become the stan- 
dard approach to simulating Ising and Potts models. Interesting variants and modifi- 
cations of this algorithm have been developed by Wolff [226] and Machta et al. [165]; 
the last paper is an interesting attempt at combining the original approach of Swendsen 
and Wang with ideas from so called invasion percolation (see [6jf]) to get an algorithm 
specifically aimed at sampling from a Gibbs distribution at the critical inverse tem- 
perature j3 c , i.e. where the use of other algorithms have proved to be most difficult. 
Generalizations of the Swendsen-Wang algorithm for various models other than Ising 
and Potts models have also been obtained, see e.g. Campbell and Chayes [48], Chayes 
and Machta [58, 59|, and Haggstrom et al. [121]. 



6.7 Random-cluster representation of the Widom— Rowlinson model 

The random-cluster model can be seen as a perturbation of Bernoulli bond percolation, 
where the probability measure is changed in favour of configurations with many (for q > 
1) or few (for q < 1) connected components. A fairly natural question is what happens if 
we perturb Bernoulli site percolation in the same way. For lack of an established name, 
we call the resulting model the site-random-cluster model. Let G be a finite graph with 
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vertex set C and edge set B. For a site configuration r\ G {0, 1} C , we write k(r)) for the 
number of connected components in the subgraph of G obtained by deleting all vertices 
x with rj(x) = and their incident edges. 

Definition 6.21 The site-random-cluster measure ip^ for G with parameters p G 
[0, 1] and q > is the probability measure on {0, 1} C which to each i] G {0, 1}^ assigns 
probability 



Mv) 



where Z^ q is a normalizing constant. 

Analogously to the usual random-cluster model living on bonds, taking q = 1 gives 
the ordinary Bernoulli site percolation ip p , while other choices of q lead to dependence 
between vertices. 

Taking q = 2 is of particular interest because it leads to a representation of the 
Widom-Rowlinson model which is similar to (and in fact slightly simpler than) the 
usual random-cluster representation of the Ising model. Let //^ be the Gibbs measure 
for the Widom-Rowlinson model with activity A on G, i.e. /j,^ is the probability measure 
on {—1,0, +1} C which to each £ £ {—1,0, +1} C assigns probability proportional to 

n %*m*-i> n xiax)l ■ 

(xy)eB xeC 



The following analogues of Corollaries |6.3| and 6.4 are trivial to check 

A 
l+A 



Proposition 6.22 Let p = t^y, and suppose we pick a random spin configuration 



X G {-1,0,+1} £ as follows. 

1. Pick Y G {0, 1}^ according to ipp 2 ■ 

2. Set X(x) = for each x G C such that Y(x) = 0. 

3. For each open cluster C ofY, flip a fair coin to decide whether to give spin +1 
or —1 in X to all vertices of C . 

Then X is distributed according to the Widom-Rowlinson Gibbs measure 

Proposition 6.23 Let p = and suppose we pick a random spin configuration 

Y G {0, 1} £ as follows. 

1. Pick X G { — 1,0, +1}^ according to fi^. 

2. Set Y(x) = \X{x)\ for each x G C. 

Then Y is distributed according to the site-random- cluster measure V^2- 

We remark that for q G {3, 4, . . .}, these results extend in the obvious way to a connection 
between tp^ and the generalized Widom-Rowlinson model whith q types of particles 
rather than just 2 (and strict repulsion between all particles of different type). 

Many of the arguments applied to Ising and Potts models in Section |6.3| can now be 



applied to the Widom-Rowlinson model in a similar manner. To apply Theorem 4. 
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we need to calculate the conditional probability in the site-random-cluster model that a 
given vertex is open given the status of all other vertices. For x G L and r\ G {0, l}A{ a; } ) 
we get 

pq l- K {x,ri) 

is °P en I "n) = pq i-n( X , V ) + 1 _ p ( 43 ) 

where k(x, rj) is the number of open clusters of rj which intersect x's neighborhood 
{y G C : y ~ x}. If the degree of the vertices in G is bounded by N, say, then 
< k(x,t]) < N for any x G C and rj G {0, lj-AM. F or fixed g and any p* G (0, 1), 
we can thus apply Theorem 45 to show that ipp q stochastically dominates Vy f° r P 
sufficiently close to 1, and is dominated by ip p * for p small enough. The arguments of 
Section |6.3| leading to a proof of Theorem 3^1 , with the random-cluster model replaced 
by the site-random-cluster model, therefore go through to show Theorem 3.4. 

One thing that does not go through in this context, however, is the analogue of (|3l|). 
The reason for this is that, in contrast to (|27|), the conditional probability in ([43]) fails 
to be increasing in 77, so that Theorem [h^ is not applicable for comparison between 
site-random-cluster measures with different values of p. In fact, the analogue of (|3l|) 
for site-random-cluster measures sometimes fails, and moreover the occurrence of phase 
transition for the Widom-Rowlinson model on certain graphs fails to be increasing in 
A, as demonstrated by Brightwell, Haggstrom and Winkler [38]. 

Another consequence of the failure of the conditional probability in (|43| ) to be in- 
creasing is that the FKG inequality (Theorem [4.1 1 ) cannot be applied to tpp q - As a 
consequence, the proof of Theorem |6.10| cannot be adapted to the case of the multitype 
(q > 3) Widom-Rowlinson model. In fact, such a Widom-Rowlinson analogue of The- 



orem J640| is known to be false, as shown by Runnels and Lebowitz [207]; see also ]^7| 
and IPC 
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7 Uniqueness and exponential mixing 
from non-percolation 

In the previous section we saw examples where phase transition in one system was 
equivalent to the existence of infinite clusters in another, suitably defined, system. In 
this section we shall discuss various approaches where conclusions about the phase 
transition behavior can only be drawn from nonexistence (and not from existence) of 
infinite clusters. On the other hand, these approaches typically apply to a much wider 
range of models. We address two problems: the uniqueness of the Gibbs measure, and 
the decay of correlations for a given Gibbs measure. In fact, the general theme of 
this section can be stated as follows: To which extent can a given spin be influenced 
by a configuration far away? If such an influence disappears in the limit of infinite 
distance, it follows (depending on the setting) that either there is no long-range influence 
of boundary conditions at all (implying uniqueness of the Gibbs measure), or that a 
specific low temperature phase exhibits some mixing properties. In both cases, the 
decreasing influence comes from the absence of infinite clusters of suitable type which 
could transport a dependence between spins. So, both uniqueness and mixing will 
appear here as a consequence of non-percolation. 

In a first part, we will address the problem of uniqueness. In fact, we will encounter 
conditions which not only imply the uniqueness of the Gibbs measure, but also lead 
us into a regime where 'all good things' happen, i.e., where the unique Gibbs measure 
exhibits nice exponential mixing properties and the free energy depends analytically 
on all relevant parameters. (In general, the uniqueness of the Gibbs measure does not 
imply the absence of other critical phenomena, which might manifest themselves as 
singularities of the free energy or other thermodynamic quantities. For example, in 
Section ^ we will see that in the so-called Griffiths' regime of a disordered system there 
is a unique Gibbs measure, but the free energy is not analytic.) 

The 'nice regime' above is usually referred to as the high temperature, or weak 
coupling, low density, or also analytic regime, and is usually studied by high temperature 
cluster expansions. Dobrushin and Shlosman []7(], fn] developed a beautiful and general 
theory describing a regime of 'complete analyticity' by various equivalent properties. 
One of these ranks at the top of a hierarchy of mixing properties. While complete 
analyticity makes precise what actually the 'nice regime' is, and applies mainly to 



high temperatures or large external fields, it is not limited to this case only [75]. The 
relationships between this and related notions and also with dynamical properties have 
been studied in many papers. Although some of these have an explicit geometric flavor, 
we do not discuss them here because of limitations of space. We rather refer to the 
sources [jn], fn], 217, |168| , 17C] and also to the references following condition (|68|), 



In Section 7.3 we shall discuss an application of the percolation method to the low 
temperature regime, and see how percolation estimates for the covariance between two 
distant observables, combined with contour estimates, give rise to exponential mixing 
properties. 

7.1 Disagreement paths 

Let (£, ~) be an arbitrary locally finite graph, and suppose we are given a neighbor in- 
teraction U : S x S — > R and a self-potential V : S — > R. Consider the associated Gibbs 
distributions fxl ^ introduced in (f|). More generally, we could consider an arbitrary 
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Markov specification (Ga)a££ in the sense of Section 2^. Such specifications appear, 
in particular, if we have an interaction of finite range R, say on Z d , and draw edges 
between all sites of distance at most R. However, for definiteness and simplicity we 
stick to the setting described by the Hamiltonian (|l]). We will often consider the inverse 
temperature j3 as fixed and then simply write /i A instead of fxl ^. If A is a singleton, we 
use the shorthand x for {x}. 

We look for a condition implying that there is only one Gibbs measure ji for the 
Hamiltonian (Q), i.e., a unique probability measure on VL = satisfying 

/u( • | X = rj off A) = fj,\ for /i-almost all r] G fl . 

Since this property needs only to be checked for singletons A = {x} (cf. Theorem 1.33 
of it is sufficient to look for conditions on the single-spin Gibbs distributions p x 

with x G C. Intuitively, we want to express that fj,^(X(x) = a) depends only weakly on 
77 (which can be expected to hold for small 0). This dependence can be measured by 
the maximal variation 

Px = max Wf4-nt\\ x , (44) 

where 

|M|a = sup \u(A)\ (45) 

is the total variation norm on the sub-cr-algebra of events which depend only on the 
spins in A. We write p as a shorthand for the family (px)x£C- 

Given two configurations £, £' G a path in C will be called a path of disagreement 
(for £ and £') if £(#) 7^ £'(x) for all its vertices x. For each finite region A C C and any 
two configurations rj, rj' on A c we will construct a coupling P of p\ and f/^ describing 
the difference of these measures in terms of paths of disagreement running from the 
boundary dA into the interior of A. Intuitively, these paths of disagreement then show 
how deep inside the influence of the boundary conditions can still be felt. We write 

{A < — > dA} for the event in S A x S A that there exists a path of disagreement from 
some point of a set A C A to some point of dA. 

Although the coupling P to be constructed is not best suited for direct use, it has 
a useful special feature: its disagreement distribution is stochastically dominated by a 
Bernoulli measure. This will allow us to conclude that absence of percolation for the 
latter implies uniqueness of the Gibbs measure for the Hamiltonian (|l|). 

We write tp p for the Bernoulli measure on {0, 1} C with ip p (X(x) = 1) = p x for all 
x G C, and ip Pt A for the analogous product measure on {0, 1} A . As in Section g wc 
use the notation X(x) and X'(x) for the projections from f2 x 17 to S. The following 
theorem is due to van den Berg and Maes [^7]] . 

Theorem 7.1 For each finite A C C and each pair n, n' G f2 there exists a coupling 
P = Pa,t],t]' of fi\ and p\ having the following properties: 

(i) For each x G A, {X(x) / X'(x)} = {x «-^-> dA} P-a.s. 

(ii) For the distribution Pif of (I{(x(x)^X' (x)})xeA under P, we have Pif <x> "0p,A • 

(iii) For each A C A, 

K-^aIU <P(A^8A) <ij p (A~dA) . (46) 
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Proof: We construct a coupling (X, X') of Ma and Ma by the following algorithm. In a 
preparatory step we introduce an arbitrary linear ordering on A , set A = A, and define 
X(x) = r](x),X'(x) = rf(x) for x G A c . 

For fixing the main iteration step, suppose that (X, X') is already defined on the 
complement of a non-empty set A C A and is realized as a pair (£,£') off A, where 
(£,£') = (r],rf) off A. Conditional on the event that (X, X') = (£,£') off A, we consider 
the Gibbs distributions Ma and Ma obtained by conditioning /x A and fi\ on X = £ 
resp. £' off A, and we pick the smallest vertex x = x (£,£') G A for which there exists 
some vertex y G A c with y ~ x and £(y) / £'(y)- If such an x does not exist, we 
have Ma = Ma on .Fa by the Markov property, so that we can take the obvious optimal 
coupling for which X = X' on A, and we are done. If such an x does exist, we consider 

the single vertex distributions Ma a = Ma(^( x ) = ') an< ^ ^Ax = Ma (-^0*0 = ' ) on 
S. Conditionally on (X,X') = off A, we then let (X(x),X'(x)) be distributed 

according to an optimal coupling (as in Definition [4.3|) of Ma x and Ma E . The coupling 



(X, X') is then defined on the set x U A c , so that we can replace A by A \ x and repeat 
the preceding iteration step. 

It is clear that the algorithm above stops after finitely many iterations and gives us a 
coupling of fj!l and . Property (i) is evident from the construction, since disagreement 
at a vertex is only possible if a path of disagreement leads from this vertex to the 
boundary. For (ii), we note that the measures and ^ x are mixtures of the 

Gibbs distributions \x a x with suitable boundary conditions a, by the consistency of Gibbs 
distributions. Hence 

ii £ 5' n 

IIMa^ — Ma,xIU — Px ■ 

By construction, this means that in each iteration of the main step we have 

P(X(x) + X\x) | (X,X f ) = off A) < Px 

for x = x(£,£'), so that (ii) follows by induction. Finally, (iii) follows directly from (i) 
and (ii) because for each A C A 

H/4 - Ma II A < P(X(x) / X'{x) for some x G A) 
by the coupling inequality (|lO|). The proof is therefore complete. □ 

Although the algorithm in the proof above is quite explicit, it is not easy to deal with 
directly. In particular, it is not clear in which way the coupling depends on the chosen 
ordering, because the site x to be selected in each step depends on (£, £') and is therefore 
random. Nevertheless, if the Gibbs distributions are monotone (in the sense of Definition 



4.9), we get some extra properties. 



Remark: Suppose S is linearly ordered and the conditional distributions /i| ar e stochas- 



tically increasing in £. Then, if r/ X rj' , the coupling P of Theorem 7.1 can be chosen 
in such a way that, in addition to properties (i) to (iii), X < X' P— a.s. and, for each 
x G A, 

IK " A < P(x OA) <(\S\- 1) |K - rf|U ■ (47) 



This is because in each step of the algorithm proving Theorem |7.l| we can achieve that 



X(x) < X'(x), and for the second inequality in ( |47| ) it is sufficient to note that 
P(x «-^-» dX) = P(X(x) < X'{x)) < i P ( X ( x ) < a )~ P(X'{x) < a)} , 



a£S\{m} 
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where m is the maximal element of S. For details we refer to [27]. In particular, for 
\S\ = 2 we have equality in d47|). 

Let us apply this remark to the ferromagnetic Ising model with external field h = and 
any inverse temperature 0, with boundary conditions n = +1 and rf = —1 outside of 
some finite region A € £ . Then, by the spin flip symmetry and stochastic monotonicity, 

mJaW 35 )) = mJaW*) = l )~ ^,a( x ( x ) = l ) = II/4,a - m^,aIU 

and therefore, by (|47|), 



4 A {X{x)) = P{x ^ dk) . 

We emphasize that this relation is completely similar to what we obtained for the 
random-cluster representation, viz. 

for p = 1 — e~ 213 ; cf. equation (^). The coupling P, however, is less explicit, and 
the geometric event involves site percolation rather than bond percolation as for the 
random-cluster measure, but the exact correspondence between the magnetization for 
the spin system and the percolation probability of the geometric system is the same. 

Let us now turn to the main result of this subsection, the uniqueness theorem. Let 
(j,, fj,' be any two Gibbs measures for the Hamiltonian ([!]) at some inverse temperature 
0. Inequality (|46|) then shows that 

||a«-//||a< sup ||ma _ MaIIa < ip p (A++dA) 

77,77' £f2 

whenever A C A G £■ Letting A f C we find 

||m - m'IIa < V>p(A 00) 
which gives the following uniqueness result. 

Theorem 7.2 If vp p (3 an infinite open cluster ) = then the setQ{(3H) of Gibbs mea- 
sures for the Hamiltonian at inverse temperature (3 is a singleton. In particular, this 
holds if sup x p x < p c , the critical density for Bernoulli site percolation on (£, ~). 



A weaker version of Theorem [7.2| was obtained first in [24] using a product coupling 



instead of Theorem 7.1; see Proposition 7.10| below and also p8[ . In some cases, the 



simple product coupling nevertheless gives equivalent results; cf. the discussion in [27] 
For a large class of regular graphs such as Z rf , the assumption of Theorem 7.2 not 



only implies the uniqueness of the Gibbs measure but even yields certain exponential 
mixing properties. This can be seen almost immediately by combining inequality ([46|) 
with Theorem 15.61 on the exponential tail of the distribution of the cluster diameter in 



sub-critical Bernoulli percolation. We will use similar arguments in Section 9J2 in the 
context of random interactions. 



Let us discuss now some special cases. Clearly, the conditions of Theorem |7.2j hold 
when C = Z with the usual graph structure, since then p c = 1. This gives uniqueness 
of the Gibbs measure for one-dimensional nearest-neighbor systems. Next we consider 
the case C = Z d , d > 2. Recall the bound (|l~9|) for the percolation threshold p c when 
d = 2, and the large-dimensions asymptotics of p c in (pC|). 
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Example 7.3 The Ising ferromagnet. Let j3 > be any inverse temperature and h an 
external field. Then, for any x, we obtain from by a short computation 

= Kft* - A£ lA JU = [tanh(/?(/i + 2d)) - tanh(/3(/i - 2d))]/2 . 

Hence, the Gibbs measure is unique when h = and tanh(2d/3) < p c , or if |/t| > 2d is 
so large that 2d < p c cosh 2 (P(\h\ — 2d)), for example. 

Example 7.4 77ie hard-core lattice gas. Setting = 1, we see that p^ = A/(l + A) for 
any x, so that uniqueness of the Gibbs measure follows for A < p c /(l — Pc)- (This can 
also be obtained by using the product coupling mentioned above, cf. f2lj .) 



Example 7.5 77ie Widom-Rowlinson lattice gas. We take again /3 = 1 and set A+ = 
A_ = A. It turns out that the maximum in equation fl44|) is attained for the boundary 
conditions rj = and r/ equal to +1 and —1 on (at least) two different neighbors of x, 
whence p x = 2A/(1 + 2A) for any x. It follows that the Gibbs measure is unique when 
\< Pc /(2(l- Pc )). 



It is interesting to compare the uniqueness condition of Theorem |7.2| with the celebrated 
Dobrushin uniqueness condition, cf. |)(| and the original papers |35|, ^] . This condition 
reads 

supJ2 ™ a: * WVx-VxWx < !• ( 48 ) 
x y v=v off y 

The constraint "77 = rf off y" means that the configurations 77, 77' differ only at the vertex 
y. For systems with hard-core exclusion or in certain antiferromagnetic models it often 
happens that, for every y G dx, the maximum in (|48|) is actually the same as that in 
(f44[), see [p7fl . Dobrushin's uniqueness condition then takes the form sup^ l^xjpx < 1. 
For C = Zi d and = po independently of x, this means that po < l/(2d), while 
Theorem ^2| only requires po < Pc, an d it is known that p c > \j(2d — 1) for d > 1. 
However, if the constrained maximum in Dobrushin's condition is much smaller than 
the unconstrained maximum in (f44|), then Dobrushin's condition will be weaker than 
that of Theorem fT^. For example, for the Ising ferromagnet on Z rf with external field 



h = 0, Dobrushin's condition requires that 2dtanh/3 < 1 which, in view of (20), is 



less restrictive than the condition obtained in Example 7.3. Thus, roughly speaking 



Theorem 7.2 works best for "constrained" systems with strong repulsive interactions 
and low-dimensional lattices (or graphs with small |<9x|'s) for which reasonable lower 
bounds of the critical probability p c are available. Examples are the hard-core lattice 



gas and the Widom-Rowlinson lattice gas on Z 2 considered above. 



There is also another reason why Theorem 7.2 is useful. Namely, its condition of 
non-percolation is a global condition: the absence of percolation does not depend on 
the value of p x at any single site x. In particular, p x could be large or even be equal 
to 1 for all x's in an infinite subset (say, a periodic sublattice) of £; once the p^'s are 
sufficiently small on the complementary set, there is still no infinite open cluster. This 
can be applied to non-translation invariant interactions where, in general, it is impossible 
to obtain uniform small bounds on the p^'s (or on the strength of the interaction, as 
would be required by the Dobrushin condition or for some standard cluster-expansion 



argument). We will come back to this point in Section 7^ 
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7.2 Stochastic domination by random-cluster measures 

Recently, Alexander and Chayes jl4|] introduced a variant of the random-cluster tech- 
nique that applies to a substantially greater class of systems than those considered in 
Section ^. This approach involves a so-called graphical representation of the original 
system. The graphical representation is stochastically dominated by a random-cluster 
model, and absence of infinite clusters in this random-cluster model implies uniqueness 
of the Gibbs measure for the original spin system. The price to pay for the greater 
generality is that the implication goes only one way: percolation in the random-cluster 
model does not, in general, imply non-uniqueness of Gibbs measures. 

We assume that the state space S is a finite group with unit element 1; the inverse 
element of a E S is denoted by a -1 , so that a~ 1 a = aa^ 1 = 1. For simplicity we assume 
that the underlying graph is C = Zi d (although this will not really matter). We consider 
the Hamiltonian (|l|) for a pair potential U and with no self-energy, V = 0. By adding 
some constant to U (which does not change the relative Hamiltonian) we can arrange 
that U < 0. The basic assumption is that U is left-invariant, so that 

U(a,b) = u{a- l b) (49) 

for all a, b E S and the even function u = U(l,-) < 0. Note that this setting includes 
the (/-state Potts model for which S = Zi q and u = — 2Isq\. For any finite A C Z d we 
consider the Gibbs distribution 



V I \ ^W=V off A} 

= ^dKW exp 



V u((7(x)-V(y)) 



(xy)eB A 



at inverse temperature (3 with boundary condition rj E O. Here we write B\ for the 
set of all bonds b E B with at least one endpoint in A. The graphical representation 
of f/l A will be based on bond configurations oj E {0, 1} Ba . Each such oj will also be 
viewed as a subset of B\, and the bonds in oj will be called open. The key idea of this 
representation is taken from the classical high temperature expansion. For fixed (3 > 
and any a E S we introduce the difference 

R a = e - pu ^ -1>0. (50) 

With this notation we can write 

*w = %£f n (i+^.)-m 



{o-=?7 off A} \ ^ 1 [ 

7k ta —\ 1^ 11 K a(x)-^a(y) 



This shows that jij^ A is the first marginal distribution of a probability measure Pjj A on 
Q x {0, 1} Ba , namely 

d £ fi, w G {0, 1} Ba . The second marginal distribution of * s ecma l to 

^, A H = W'^»/^ A (/3,7 7 ), 
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where 

a=r\ off A (xy)£u 

is the "graphical weight" of any uj £ {0, 1} Ba . The probability measure 7^ A on {0, 1} Sa 
is called the graphical distribution or the grey measure (since it ignores the spins which 
are considered as colors). The graphical representation of \ip A thus obtained is analogu- 
ous to the random-cluster representation of the Potts model and can be summarized as 
follows. 

Lemma 7.6 In the set-up described above, the Gibbs distribution fjJl A can be derived 
from the graphical distribution 7^ A by means of the conditional probabilities 

b=(xy)£uj 

That is, 

/ i /3,A( fJ )= E T^aH^aH^) ■ 
^e{o,i} SA 

For the Potts interaction u = — 2/| } with state space S = Z q , the graphical represen- 
tation above is easily seen to coincide with the random-cluster representation studied 
in Section ^. One important feature is that the graphical weights factorize into cluster 
terms. Indeed, each bond configuration lo divides A into connected components called 
open clusters (which may possibly consist of isolated sites). The set of bonds belonging 
to an open cluster C is denoted by loq- Writing C(u>) for the set of all open clusters we 
then obtain that 

W £a(") = IT Wl A (C,u c ) (52) 

with 

WJU(C lUo )= E II fl,w-MW. 

a£S c : o=r) on CndA (xy)&u)c 

(We make the usual convention that the empty product is equal to 1; hence A (C, u>c) 
= \S\ if C is an isolated site.) Together with Lemma |7.6| , equation (^) shows that the 
spins belonging to disjoint open clusters are conditionally independent. In particular, 
we can simulate the spin system by first drawing a bond configuration u with weights 
(51) and then obtain in each open cluster a spin configuration according to Pjj A (a\uj). 

Suppose we knew that there is no percolation in the graphical representation, in 
the sense that max r? 7^ A (0 <-> d A) — > as A f C The conditional independence of 
spins in different open clusters would then suggest that there is only one Gibbs measure 
for the spin system. Unfortunately, this is not known (though weaker statements are 
established in [p^]). However, one can make a stochastic comparison of the graphical 
distributions with wired random-cluster distributions (Lemma [7.7| below), and the ab- 
sence of percolation in the dominating random-cluster distribution will then guarantee 
that the original system has a unique Gibbs measure. This will be achieved in The- 
orem 7.8 allowing to bound the dependence on boundary conditions in terms of the 



connectivity probability in a random-cluster model. 
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To this end we also need to consider Gibbs distributions \iL A with free boundary 

condition. These admit similar graphical representations 7^ A based on bond configura- 
tions inside A; that is, the bonds leading from A to A c are removed. In the following, 
the superscript / will refer to this case. 

The stochastic comparison with random-cluster distributions will be formulated us- 
ing 

R* = maxR a , R=j^^R a , P = -R*/(l + R*) , q = R*/R. (53) 



aeS " ' |^| ^ — ' 



Note that these quantities depend on (3 since the R a in ( |50| ) do. In the case of the 
r-state Potts model when u = —21^, we have R* = 1 — e~ 2 @ and q = r; that is, 
in this case the parameters p and q are nothing but the standard parameters of the 
random-cluster representation. For p and q as above we consider now the wired (resp. 
free) random-cluster distribution <fip q a ( res P- 4>^ q a) m as introduced in Section |6.2| . 

Lemma 7.7 For any A G £, (3 > andp, q as above, 7^ A <x> ^q.A an d Tr a — 23 ^p,q,A- 

Proof: We only prove the first statement since the second is similar and simpler. 
According to Section |6.1| , the weights of the random-cluster distribution </>p 9 a are pro- 
portional to 



1 — p, 

with \uj\ the number of open bonds and k(uj, A) the number of open clusters meeting 
A (where all clusters touching <9A are wired together into a single cluster). Up to a 
constant factor, the Radon-Nikodym density of 7^ A relative to ^ A is thus given by 

F(u) = Wl k {uj) I {R*) H (R*/R) k ^^ . 

Since <fip q \ has positive correlations, the lemma will therefore be proved once we have 
shown that F is a decreasing function of to. To this end we let uj ■< u' be such that 
lo' = lj U {b} for a bond b E B\ \ uj. 

We first consider the case when b = {xy) is not connected to <9A and joins two open 
clusters C x , C y G C{lo). For each open cluster C let W(C) = Wg A (C,ooc) be as in (|5^). 
Suppose we stipulate that the spin a(z) at any site z G C is equal to some a £ S. It 
is then easy to see that the remaining sum in the definition of W{C) does not depend 
on a and thus has the value M^(C) /| *S| . Prescribing the values of o~(x) and o~(y) in this 
way we thus find that 

w(c x uc y ub) = w(c x )w(c y )\s\~ 2 Yl R ^)-^(y), 

and therefore Wl A (u') = RWj} A (u). Since k(u)',A) = k(u),A) - 1 and |u/| = \u\ + 1, 
it follows that F{oj') = F{ui), proving the claim in the first case. If b links some cluster 
to the boundary which otherwise was separated from the boundary, then the argument 
above shows again that F{uj') = F(iv). 

Next we consider the case when b = {xy) closes a loop in to but is still not connected 
to the boundary. Since clearly 



-fto-M-Mv) - ma %Ra — R* 
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we find that < Wp A (uj)R*. On the other hand, in this case we have \u'\ = 

\u\ + 1 and k(u', A) = /c(u>, A), so that F{oj') <F(uj). As we are considering the wired 
random-cluster measure, this argument remains valid if b joins two clusters already 
attached to the boundary. □ 



We are now in a position to state the main result of Alexander and Chayes [14], an 
estimate on the dependence of Gibbs distributions on their boundary condition in terms 
of percolation in the wired random-cluster distribution. Recall the notation (45) for the 
total variation norm on the sub-<7-algebra of events in some A. 



Theorem 7.8 Consider the spin system with pair interaction at some inverse 

temperature (3 > 0, and let p, q be given by Then, for any A C A G £ and any 

pair of boundary conditions r/, rj G £1, 

1 1/4 a ~ /^'aIU - ^L,a( a «-» dA ) ■ 

Proof: (This proof is different from the one that appeared in Jl4|].) Let A be any event 
in J- & . From Lemma |7.6| we know that 



To control the 77-dependence of this probability we will proceed in analogy to the argu- 
ment for the implication (ii) =>- (iii) of Theorem |6.10[ If u> G {A <A dA} then equation 
( |52|) shows that the conditional distribution Pjj a (A\uj) does not depend on 77. So we 
need to control the //-dependence of 7^ A (A <A dA). This, however, does not seem pos- 
sible directly. So we will replace 7^ A by the //-independent ^ A by using a suitable 
coupling trick. 



By Lemma 7.7 and Strassen's theorem (Theorem ^fj) there exists a coupling (1", Y') 
of 7^ A and </>p gA such that Y <Y' almost surely. If Y' G {A <A <9A}, there exists a 
largest (random) set T = T(Y') such that 

(a) A C T C A, and 

(b) Y'(b) = for all bonds connecting T with T c . 



For Y' G {A <-> <9A} we set T = 0. Conditional on T, Lemma 7.7 and Strassen's theorem 
provide us further with a coupling (Yr, Yp) of 7^ r and <pp q -p such that Yr ^ Yj>. It is 
then easy to see that the pair of random variables (Y, Y') defined by 

(Y,Y')(b) = 



(Yr, Y r )(6) if b is contained in T, 
(Y,Y')(b) otherwise 



is still a coupling of 7^ A and (f>~ A such that Y ■< Y' almost surely. (Notice that also 

Y(b) = for all bonds from T to r c .) We denote the underlying probability measure by 
Q v . Now we can write 



4A A ) = Q\ p l^ A \ Y ) 

= Q^PlA(A\Y)I {r=9} )+Qv(Pl A (A\Y)I {rm ) . 
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The first term in the last sum is at most Q r? (r = 0) = 0* A (A <-> 9A). We claim that 
the second term does not depend on 77. Indeed, if V ^ then, by (|52|), A (^4|Y) = 

P^ r (A|lr) only depends on the restriction Yp of Y to the set of bonds inside V. The 
second term can thus be written explicitly as 

E <,,A( r = G) £ t£gH ^/,g(^) , 

G^0 winG 

which is obviously independent of 77. The theorem now follows immediately. □ 

To apply the theorem we consider the limiting random-cluster measure </>i „ with arbi- 
trary parameters p G]0, 1[ and g > 1 and wired boundary condition; recall from Section 



5.2 that this limiting measure exists. By Corollary 6.7(d), it makes sense to define the 
percolation threshold 

p c {q) = inf{p : 0* ? (O <-> 00) > 0}. 

We also consider the threshold w c (q) for exponential decay of connectivities, which is 
defined as the supremum of all p's for which 

0j ig (O <-» 0A) < c e - cd (°' 9A ) 

uniformly in A (or, at least, for A in a prescribed sequence increasing to C) with suitable 
constants c > and C < 00. It is evident that w c (q) < p c (q)', for large q it is known 
that w c (q) = p c (q) |75|. Theorem [F^ then gives us the following conditions for high- 



temperature behavior; compare with Theorem 7.2. 



Corollary 7.9 Whenever (3 is so small that p < p c {q), there is a unique Gibbs measure 
for the Hamiltonian H with pair interaction ffity- Furthermore, if in fact p < w c (q) 
then the spin system is exponentially weak-mixing in the sense that there are positive 
constants C < 00, c > such that for all A C A G £ and all boundary conditions 

iK,A-^; A iiA<ci5A| e -^ c ). 

In fact, Alexander and Chayes [14] go a bit further in their exploration of 'nice' high 
temperature behavior, showing that for p < p c (l) the unique Gibbs measure satisfies 
the condition of 'complete analyticity' (investigated in [jnfl, for example). 



7.3 Exponential mixing at low temperatures 

In the previous subsections we have seen how stochastic-geometric methods can be 
used to analyze the high temperature behavior of a spin system and, in particular, 
for establishing exponential decay of correlations. Here we want to demonstrate that 
similar percolation techniques can also be used in the low temperature regime in the 
presence of phase transition. We will present a method to show that, for a given phase 
/j, the covariance //(/; g) of any two local observables / and g decays exponentially fast 
with the distance between their dependence sets. (The problem of phase transition at 
low temperatures will be addressed in Section |{|) 

As a matter of fact, the problem of exponential decay of covariances (or truncated 
correlation functions) arises in many physical situations. Correlation functions are re- 
lated to interesting response functions or to fluctuations of specific order parameters. 
Exponential decay of covariances also provides estimates on higher order correlation 
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functions, eventually providing infinite differentiability of the free energy with respect 
to an external field [f7^| . Motivated by these interests, a variety of techniques have been 
developed. The most familiar approach are cluster expansions which apply equally well 
to both the high temperature (or low density) regime and the low temperature (or high 
density) regime. Although they often employ geometric concepts, it seems useful to 
combine them with ideas of percolation theory to make geometry more visible. An 
example of this is the method to be described below which is taken from a paper by 



Burton and Steif 1 37 1 , where it was used to show that certain Gibbs measures exhibit a 
powerful mixing property called 'quite weak Bernoulli with exponential rate'. 

We consider a spin system on an arbitrary graph (£, ~) with Hamiltonian (||). As 
before, the essential feature of this Hamiltonian is that only adjacent spins interact, so 
that the Gibbs distributions /x^ A in finite regions A have the Markov property. The 
inverse temperature (5 > does not play any role for the moment, so we set it equal to 
1 and drop it from our notation. 

Our starting point is the following estimate on the 77-dependence in terms of dis- 
agreement paths for two independent copies of /jlJ^. This result is a weak version (and, in 



fact, a forerunner |M[]) of Theorem 7T. It is a pleasant surprise that although developed 
with high temperature situations in mind, it also provides a useful alternative to some 
aspects of the standard low temperature expansions. 

Proposition 7.10 For any A C A E E and 77,7/ E Q, 

\K-^\\ A <nlx4(A^dA). 

Proof: For brevity let P = x fi\, and write X, X' for the two projections from 
$1 X f2 to O. Then for any A E .Fa we have 

»l(A) - 4(A) = P(X E A) - P(X' E A) . 

We decompose the probabilities on the right-hand side into the two contributions ac- 
cording to whether the event {A < — > <9A} occurs or not. In the latter case, there exists 
a random set T C A containing A such that X = X' on dT. (The union of all disagree- 
ment clusters in A meeting A is such a set.) Let T be the maximal random subset of 
A with this property. Then for each G the event {r = G} only depends on the config- 
uration outside G, and X = X' on dG. The Markov property therefore implies that, 
conditionally on {r = G} and (A^rc, X' Gc ), Xq and X' G are independent and identically 

distributed, and this shows that P(X 6 A, Af dK) = P(X' £ A, A <fc <9A). The 
proposition now follows immediately. □ 

What is gained with the disagreement estimate above? First, let us observe that this 
estimate provides bounds on covariances of local functions in terms of disagreement 
percolation. 

Corollary 7.11 Fix any A E £ and 77 E O. Let f and g be any two local functions 
depending on the spins in two disjoint subsets A resp. A' of A. Then 

K(f;g)\ < 6(f) 5(g) n\ x ,4 (A A' m A) 
where 5(f) = max^ /(£) — min^ /(£) is the total oscillation of f . 
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Proof: By rescaling and addition of suitable constants we can assume that < / < 1 
and < g < 1. Proposition 7.10| then shows that 



K(f;g)\< vlWgtf) 



Ma\a'(/)-Ma\a'(/) 
< 5(f) 5(g) J »l(d® J ^l(dO m!\ a , x /4\ A ,(A ^ A' in A) 

because £ = £' = r] on <9A. By carrying out the last integration we obtain the result. □ 

The bounds above leave us with the task of estimating the probability of disagreement 
paths in a duplicated system. In contrast to the situation in Section |7.1] , we are looking 
now for estimates valid at low temperatures. If a cluster expansion works, there is no 
need to look any further. For instance, a low temperature analysis and estimates of semi- 
invariants for the Ising model can be obtained using standard contour representations; 
see |3?|]. It needs to be emphasized, however, that the main step of cluster expansions 
consists in expanding the logarithm of the partition function. Only afterwards, by 
taking ratios of partition functions, does one obtain expressions for covariances and 
higher order correlation functions. Therefore, a point to appreciate is that the bound 



of Corollary 7.11 provides a direct geometric bound on covariances which avoids the 
machinery of cluster expansions and, in particular, the problems coming from taking 
logarithms. As a consequence, this estimate also applies to some cases where standard 
cluster expansions are doomed to fail. 

Let us illustrate this for the case of the Ising ferromagnet. This might not be the 
best example because other methods can also be applied to it; nevertheless, it is useful 
to demonstrate the technique in this simple case. Afterwards we will discuss a case 
where cluster expansion techniques cannot be used equally easily. 

Consider the low temperature plus phase lit of the ferromagnetic Ising model on 
the square lattice C = Z 2 with zero magnetic field. By taking the infinite volume limit 
in Corollary [7.11 with r\ = +1 we obtain the estimate 



\ni(f; 9 )\ < 5(f) 8(g) M + x M +(A M A') (54) 

for the covariance of any two local functions /, g with disjoint dependence sets A, A'. 

Next we observe that the event {A A'} is clearly contained in the event that there 
exists a path from A to A' along which (X, X') ^ (+,+). The latter event will be 

denoted by {A * < + 'V A'}. Now, Burton and Steif |J7j] have shown how to estimate 
the probability of this event to occur in the duplicated plus phase. The result is the 
following. 

Theorem 7.12 For the Ising ferromagnet on Z 2 at sufficiently large (3, there exist 
constants c > and C < oo (depending on (3) such that for any two disjoint sets 
A,A'e£, 

fi+ x fj+(A A') < Cmm{\diA\,\d t A'\} e - cd{A ' A,) , 

where <9jA = d(A c ) is the inner boundary of A. 

Combining this theorem with we obtain an exponential bound for the covariance 
of any two local observables in the low temperature Ising plus phase. While this result 
is well-known, its proof below shows how one can proceed in more general cases. 
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Sketch proof of Theorem |7.12| : It is sufficient to prove the statement with \it 
replaced by where A is a sufficiently large square box containing A, A'. For 

brevity, let Pa = l^t \ x Ms a- Suppose that \db!\ < \dA\, fix an arbitrary x € c\A', 

and suppose that the event {x <-^$ A} occurs. Let To = Tq(X,X') be the maximal 
connected set containing x on which (X, X') ^ (+,+). Also, let T be the union of To 
and all finite components of To c . T is enclosing in the sense that both T and T c are 
connected. In fact, T can be identified with a contour, the broken line which separates 
r from its complement. 

Consider the set C of all enclosing sets containing x and contained in A. For any 
integer £ > 2, let Ci be the set of all C E C such that \diC\ = £. Since the number 
of contours with length i surrounding a given site on the square lattice is bounded by 
£3 l and since each enclosing set with \d{C\ = £ uniquely defines a contour with length 
between £ and 4£, we have that \Ci\ < 3(4£ + l)/2(8l/ growing exponentially with £. 
Now we can write 

p A (x^ +) A) = p A (rnA^)< ]T ^p A (r = c). 

£>d(A,A') C&Ci 

Furthermore, for C G Q we have 

p A (r = C) < J2 ^0,a( X = ~ l on D,X = +1 on dC) 

DcdiC 

x /uJ A (X = -1 on diC \ D, X = +1 on dC) . (55) 



Now, a standard Peierls estimate (see e.g. ||213| 1) shows that 

i4A x - ~ l on jD ' x - +1 on dc ^ - e ~ c(/3) |D| ( 56 ) 

with constants c(/3),C(/3) independent of A satisfying c{(3) — > oo as (3 — > oo. Substi- 
tuting ( |56|) into (|55| ) we obtain the theorem by simple combinatorics and summations. 
□ 

We emphasize that the specific properties of the plus phase /it are used only in the last 
step, the Peierls estimate (|56|). Before, we needed only the Markov property. Therefore 
it is useful to note that the Peierls estimate is not limited to the Ising model; it remains 
valid under the conditions of the standard Pirogov-Sinai theory [213]. In particular, it 



follows that the results of Burton and Steif []37]] on the ergodic properties of the Ising 
model carry over to more general Markovian models of Pirogov-Sinai type. 

Let us finally discuss a case in which a Peierls estimate of the form (|5^ ) is not 
available. Namely, we ask for covariance estimates of local functions, still for the ferro- 
magnetic Ising model in a large square A, but now for some boundary condition rj not 
identically equal to +1. This question arises, for example, in the context of correlations 
atop of a disordered surface, or in the problem of establishing a Gibbsian description 
of non-Gibbsian measures. In fact, in [|167|| the method to be described below is used 
to prove that the projection to a line of the low-temperature plus phase of the two- 
dimensional Ising model is weakly Gibbsian. 

To be specific, suppose that the boundary condition rj on <9A is not identically plus 
but contains a large proportion of plus spins; we stipulate that rj = +1 on three sides of 
A while on the remaining side only a large fraction of the spins is plus. In this case, (|56|) 
cannot be true because D can be small and close to the boundary of A. For example, if 
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E = dD n <9A ^ and rj happens to be minus on E then it is not very unlikely that all 
spins in D are minus, and ( |56| ) will not hold. However, one can take advantage of the 
fact that for small D its complement in <9jC is large, and vice versa. In other words, to 
estimate the right-hand side of ( |55|) one should not apply (^) separately to each factor, 
but rather one can hope to estimate their product. 

To make these general remarks precise we consider the Ising model on the half-plane 
Z x Z + . For any n we consider the square A n = {(xi, X2) G Z 2 : — n < x\ < n, < X2 < 
n} touching the boundary line Zx {0}. On this line we fix a configuration £ G {+1, — 1} Z , 
thereby defining a boundary condition on one part of dA. On the remaining part of dA 
we impose plus boundary condition. That is, we choose the boundary condition n = +1 
on dA n \ (Z x {0}) and n = £ on Z x {0}. We ask for the correlation of the spins at the 
sites x = (0, 1) and y = (k, 1) with < \k\ < n. 

Theorem 7.13 In the situation just described, suppose £ is such that 

m — 1 

£ CO', °) > 8m / 9 and °) ^ 8m / 9 

3=0 j=-m 

/or sufficiently large m, and let n be defined as above. Then there are constants c > 
and C < 00 (not depending on n) such that 

l4 An (X(0,l);X(M))l < Ce" c l fc l 
whenever (3 and \k\ are sufficiently large and n > \k\. 

Sketch proof: We proceed as in the proof of Theorem [7.12 . In dealing with the 



right-hand side of (55) we must take into account that possibly dC n dA 7^ 0. We 
therefore replace dC by dC \ dA in the product term and also estimate the probabilities 
of intersections by conditional probabilities, yielding the upper bound 



'£{X = -1 on D) fJ,p$(X = -1 on fyC \ D) 



for the summands on the right-hand side of (|55|). Here, Mac st an ds for the Gibbs 
distribution in C with boundary condition equal to +1 on dC D A and equal to n on 
dCndA. To derive the theorem we need to replace the Peierls estimate (^) by a similar 
bound on the last product. The exponential decay of correlations then again follows by 
simple combinatorics and summations. 

To make the influence of the boundary condition rj explicit we exploit a contour 
representation leading to the estimate 

Hp${X = -1 on D) fip$(X = -1 on d t C \ D) 

< e n «*>(7) n ^(V) • (57) 

r,r' inside C j£T J f ^T f 

r compatible with D 
r' compatible with d^C\D 

The right-hand side is defined as follows. For any configuration a G {+1, — 1} A we draw 
horizontal resp. vertical lines of unit length between neighboring sites of opposite spins, 
doing as if the boundary spins were all plus; we then obtain a disjoint union of closed 
non-self-intersecting polygonal curves. Each of these curves is called a contour 7, and 
a set r of contours arising in this way is called compatible. We thus have a one-to-one 
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correspondence between spin configurations a and compatible sets Y of contours. If 
a = — 1 on D, then each component of D is surrounded by some contour 7 (i.e., belongs 
to the interior Int7 of 7); the smallest contours surrounding the components of D are 
collected into a set Y of contours. Each set Y arising in this way is called compatible with 
D. The probability that a given set Y of contours occurs is not larger than I^er w, n(7), 
where 



^(7) = exp 



2/3| 7 |+2/3 J2 E 0-~v(y)) 

rcGlnt 7 yGdA.y^x 



and I7I is the length of 7; this can be seen by comparing the probability of a configuration 
containing Y with the probability of the configuration obtained by flipping the spins in 
U 7e rl n t7- These observations establish the inequality (|5"7|), 

Note that the weight 10^(7) of a contour 7 depends on the boundary configuration 
77; this is because we have chosen to draw the contours for plus boundary conditions 
rather than 77. It follows that 7*^(7) does not necessarily tend to zero when I7I grows 
to infinity; this is in contrast with the case rj = +1. The standard low temperature 
expansion would therefore become much more complicated. However, if the density of 
plus spins in 77 is sufficiently large, or if dint 7 n dA is rather small, the standard weight 
exp[— 2/3|7|] of the Ising contours will dominate, and the right-hand side of ([57]) can be 
estimated, as we will show now. 

We unite the sets Y,Y' in ( |57| ) into a single set of contours Y = ruf. The contours 
in r can overlap, but a site of diC can only belong to the interior of at most two 
contours. On the other hand, every site of diC is in the interior of at least one contour 
of r, and \diC\ > k, the distance of the two spins considered. These ingredients allow 
us to control the sum on the right-hand side of (57). If k is so large that the density 



of plus spins in £ between and (k,0) exceeds 8/9 then we find for any collection of 
contours r = Y U Y' as above 



EE E a - V(V)) < 5/9 l7l 



This yields 



Y[w v (i) I] W) ^ II ex P[- 8 /9/3|7|] J] exp[-8/9/3|V|] . 
■y&r 7'er' 7er 7'er' 

At this point the standard arguments take over (with (3 replaced by 4/3/9), leading to 
an exponential estimate of (|57|). For example, one can conclude the proof along the 



lines of Lemma 2.5 of [37]. □ 
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8 Phase transition and percolation 



Typically, two ends of the phase diagram are amenable to mathematical analysis. One is 
the high temperature, or low density, regime which was discussed in the previous section 
and in which the system can be viewed as a small perturbation of an independent spin 
system. The other end is the low temperature regime which we will now consider. At 
low temperatures, the energy dominates over the entropy which comes from the thermal 
fluctuations of the spins. One therefore expects that the spin configuration is typically 
similar to some frozen zero temperature state, which is a configuration of minimal energy 
and thus called a ground state. The similarity of a low temperature state with a ground 
state is conveniently described in geometric terms: one imagines that the spins which 
agree with the given ground state form an infinitely extended sea, whereas those spins 
which have chosen to deviate from the ground state are confined to interspersed finite 
islands. This is, of course, a picture of percolation theory: spins that agree with the 
ground state form a unique infinite cluster. We are thus led to the concept of agreement 
percolation, which will be discussed in the first part of this section. 

In fact, agreement percolation is intimately related to the existence of a phase tran- 
sition. If several distinct ground states exist, we may hope to find at low temperatures 
also several equilibrium phases which can be distinguished by agreement percolation 
with respect to the different ground states. One may ask further whether the geo- 
metric picture that applies to low temperatures remains valid throughout the whole 
non-uniqueness region. Physically, this is a matter of stability of the ground states. 
Mathematically, it means to look for conditions under which distinct Gibbs measures 
allow distinct stochastic-geometric characterizations. 



We will approach this question from two different sides. In Sections 8.1 to |8.4] we 
investigate whether "phase transition implies percolation" . We study a fixed equilibrium 
phase [x in the non-uniqueness region which, by its very construction, can be viewed 
as a random perturbation of some ground state rj. We then will see that, in many 
cases, spins that agree with r\ do percolate. After a general discussion of agreement 
percolation in Section 8.1, we investigate this concept in the subsequent subsections 
for some specific models including the Ising ferromagnet and the Potts model. In the 
case of the planar lattice Z 2 with its limited geometric possibilities we will also see that 
conversely, the absence of phase transition sometimes implies an absence of percolation, 
and that in the case of phase transition one has restrictions on the number of phases. 
(Methodologically, these results still run under the heading "phase transition implies 
percolation".) In the last Section 8.5 the converse will be treated more systematically 
and under a different aspect: we will show that at low temperatures one has percolation 
of bonds along which the interaction energy is minimal, and we will see that such a 
ground- energy bond percolation often implies a phase transition. Taken together, these 
results will show that in various models phase transition comes along with the existence 
of a ground-state sea with finite islands (deviation islands) on which the spins deviate 
from the ground state, and vice versa. 

A theory developing this picture in much more detail is the Pirogov-Sinai theory of 
phase transition which deals with the low temperature phase diagram in the presence of 
several stable ground states. One basic idea of this theory is to treat the finite deviation 
islands of a low temperature system as the constituents of a low density gas of hard-core 
particles. While the Pirogov-Sinai theory is intimately related to the subject of this 
section, it is much too involved to be developed here. There is, however, a number 
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of expositions which may serve as general introductions and in which many additional 

H |7|, H|, HH and |{ 



to |230|. Here 



references can be found. We mention only 
we will concentrate on more specific results which are partly beyond the Pirogov-Sinai 
theory, in that they are not limited to low temperatures but rather apply to the full 
non-uniqueness region, and comment occasionally on some relationships. In particular, 
the results of Section 8.5 are similar in spirit to this theory. 



8.1 Agreement percolation from phase coexistence 

We consider again the general setting of Section ||. (£, ~) is an arbitrary locally finite 
graph, S is a finite set, and O = S c . Suppose /i is a random field and r\ £ Q, a fixed 
configuration. We consider the event {x < — > 00} that x £ C belongs to an infinite 
cluster of the random set R(rj) = {y £ £ : X(y) = n(y)}, and we say that [i exhibits 
agreement percolation for n if <— —* 00) > for some x £ C. In short, we will then 
simply speak of 77-percolation. To visualize such an agreement, it may be convenient to 
think of a reduced description of /i in terms of its image under the map s v : Q — > {0, 1} , 
which describes local agreement and disagreement with rj, and is defined by 

(S " ((J))(X) = { otherwise. (58) 

With this mapping, we can write {x < — > 00} = s~ 1 {x^oo}. 

We are interested here in the case when \i is a Gibbs measure for the Hamiltonian 
(||), and 7] is an associated ground state. We say that a configuration n £ Q is a ground 
state or, more explicitly, a ground state configuration for the relative Hamiltonian (||), 
if i?(cr|r/) > for any local modification (or "excitation") a of n. In other words, n is 
a ground state if, for any region A £ £, the configuration rj minimizes the energy in A 
when r/Ac is fixed. One should note in this context that, in the low temperature limit 
/3 T co, the finite- volume Gibbs distribution ^ A from (Q) tends to the equidistribution 
on the set of all configurations a of minimal energy H{a\rf). This fact suggests that, at 
least in some cases, the low temperature phase diagram is only a slight deformation of 
the zero-temperature phase diagram describing the structure of ground states. This is 
precisely the subject of Pirogov-Sinai theory which provides sufficient conditions for this 
to hold, proposes a construction of low temperature phases as perturbations of ground 
states, and also shows that the size distribution of the deviation islands has exponential 
decay. 

Suppose next that the Gibbs measure // is related to the ground state 77 in some way. 
For example, \x might be obtained as the infinite volume limit of the finite volume Gibbs 
distributions ^ A with boundary condition rj, possibly along some subsequence. (Under 
the conditions of the Pirogov-Sinai theory such a limit always exists.) In the case of a 
phase transition, when other phases than \x exist and one is interested in characteristic 
properties of /U, one expects that the relationship between /i and rj becomes manifest in 
a macroscopic pattern of the typical configurations, in that fi shows r/-percolation. In 
short, we ask for the validity of the hypothesis 

\Q{f3H)\ > 1, (j, is extremal in Q{f3H) and related to a ground state n £ £1 
=^ (j,(x A 00) > Vx £ C . (59) 

In the specific cases considered below it will always be clear in what sense ^ and 77 are 
related; typically, /x will be a limiting Gibbs measure with boundary condition n. We 
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emphasize that (|59|) does not hold in general; a counter-example can be constructed 
by combining many independent copies of the Ising ferromagnet to a layered system, 



see the discussion after Proposition S.3 below. Also, even when (59) holds, it does 



not necessarily imply that the phase is uniquely characterized by the property of 
//-percolation. 

How can one establish (59)? In the context of the Ising model, Coniglio et al. [S2| 
and Russo |208| developed a convenient citerion which is based on a multidimensional 



analog of the strong Markov property and thus can be used for general Markov random 
fields |36], 100 1 . One version is as follows. 

Theorem 8.1 Let (£, ~) be a locally finite graph, fi a Markov field on Q, = S , and 
7] £ £1 any configuration. Suppose there exist a constant c G R and a local function 
f : — > R depending only on the configuration in a connected set A, such that M/) > c 
but 

M/ | X = £ on dT) < c (60) 
for all finite connected sets T D A and all £ G £1 with Sr;(£) = on dT. Then 
MA <— — » oo) > 0, i.e., n exhibits agreement percolation for rj. 

Proof: Suppose by contraposition that //(A oo) = 0. For any e > we can then 



choose some finite A D A such that //(A 



A c ) < e. For £ <£ {A A A c }, there 



exists a connected set T such that A C T C A and s„(£) = on dT; we simply let T 
be the union of A and all //-clusters meeting <9A. As in the proof of Theorem 6.10| , we 
let r(£) be the largest such set. For £ G {A A c } we put r(£) = 0. Then, for each 
finite connected set r / 0, the event {£ : r(£) = T} depends only on the configuration 
in A\r, whence by the Markov property M/ | T(-) = T) is an average of the conditional 
probabilities that appear in assumption (|60|). ^From this we obtain 



□ 



< c/i(T(.) ^ 0) + MI/I I {r(-)=0}) < c + e 

Letting e — > we find M/) — c ) contradicting our assumption. 

In most applications we will have a natural candidate for the function /. Whenever 
distinct phases do exist, they can be distinguished by some order parameter, viz. a 
local function / having different expectations for the two phases. If, in addition, some 
stochastic monotonicity is available then we can hope to establish (p0|). In fact, the 



percolation phenomena stated in Examples 5.11 to 5.15 can be deduced from a slight 
modification of Theorem 8.1. We will not go into the details of these examples which 



are treated in 36], but rather apply Theorem 8.1 to our standard examples. 



8.2 Plus-clusters for the Ising ferromagnet 



The idea of agreement percolation was first developed in the context of the ferromag- 
208 1 . Let us apply Theorem |S.1| to this standard case. We only 

0, so the only parameter is the 



netic Ising model |6S 



consider the case of no external field, i.e., we set h 
inverse temperature (3 > 0. We are interested in agreement percolation for the constant 
configurations rj = +1 resp. r] = —1, which are the only periodic ground states of the 
model. We write < — ► resp. < — > for the corresponding connectedness relation. Our first 
result shows that if there is a phase transition then there is plus-percolation for each 
Gibbs measure except the minus-phase that is, assertion (|59|) holds for rj = +1. 
This result (due to |208j 1 ) is valid for an arbitrary locally finite graph (£, ~) with finite 
critical inverse temperature (3 C . 
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Theorem 8.2 Let fi be an arbitrary Gibbs measure for the ferromagnetic Ising model 
with parameters (3 > 0, h = 0. If /j, ^ \i2 then [i{x <-^-> oo) > for all x G C. In 
particular, if (3 > j3 c then n~t(x < — ► oo) > /or a// x G C. 

Proof: By the sandwiching inequality ( Jl5| ) and Proposition 4.12, there exists a site 



x G £ such that fi(X(x) = 1) > c = «g (^(^) = !)■ On the other hand, the analogue of 
inequality (|l4|) for the minus boundary condition shows that 

fj,(X(x) = l\X = -londT) = Hp T {X(x) = 1) < c 



for every finite T 3 x. Theorem 3.1 thus gives the result for the x at hand. In view of 



the finite energy property of this extends easily to all other x G C. □ 



Let us rephrase the last statement of Theorem 3.2 as follows: below the critical tem- 
perature the plus spins percolate in the plus phase and, by symmetry, the minus spins 
percolate in the minus phase fj,^. In the case of graphs with symmetry axes, this state- 
ment allows an interesting refinement. 

Proposition 8.3 Suppose (£, ~) admits an involutive graph automorphism r which 
maps a subset 7i C C onto its complement TL C , and that for x G TL, y G TL C either x </■ y 
or y = rx. For x G TL let {x oo in TL} be the event that there exists an infinite path 
7 inH starting from x such that all spins along both 7 and its reflection image rj are 
positive. If fig 7^ fip then l^t(x <— ^— > 00 in TL) > for all x eH. 

One natural case to think of is when C = Zi d for d > 2, TL a halfspace with boundary 
orthogonal to an axis, and r the associated reflection. The proposition then asserts that 
/i^-almost surely there exists an infinite connected mirror-symmetric set of plus spins. 
Another interesting case is when C consists of two disjoint copies of a graph TL which 
are not connected to each other by any bond. In this case, /J,~t = X Mg-^, and 

the statement is that two independent realizations of A^-ft exhibit simultaneous plus- 
percolation; in this case, the preceding proposition was observed by Giacomin et al. 
[|100f1 . It is, however, not possible to take an arbitrarily large number k of independent 
realizations Xi, ... , Xk of /^g W , at least when TL has bounded degree N. For, if p c is 
the Bernoulli site percolation threshold of TL and k is so large that 

SUp/l} f x \( X ( X ) = l f <Pc 

then the set {x G TL : X\(x) = ■ ■ ■ = Xk{x) = 1} does not percolate. This follows from 
a standard domination argument. Since the layered system consisting of k independent 
copies of the Ising model with (3 > (3 C certainly exhibits a phase transition, we see that 
hypothesis ( |59"| ) does not hold in general. 

Proof of Proposition 8.3: We identify each £ G with (£(x), £,{rx)) x( zji G S , 



where S = {—1, l} 2 . The event under consideration then corresponds to ^-percolation 
for the configuration r\ G with i](x) = (1, 1) for all x G TL. Let / = X(x) + X(rx). 
Then fJ^(f) = 2 fi^(X(x)) > by the r-invariance of fit. On the other hand, let T C TL 

be a finite set containing x, and T = T U rT. If (£,£') G = Q with s^^,^') = on 
d n T = drnTL, then £' < -£ on d H T, and therefore (£, £') < (£, -£) (as elements of O) 
on dT = dyT U r d-j-cT. We can thus write 

H+(f\(X,X') = &?)cmdr) = ^f(X(x))+^f(X(rx)) 

< ^\x{x))+$f Q (X(rx)) = 
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by Lemma 4.13 and the symmetry under r and simultaneous spin flip. The proposition 
thus follows from Theorem 3.1. □ 



In the remaining part of this subsection we consider the case of the square lattice 
C = Z 2 , in which we can obtain much stronger conclusions. The following result gives a 
complete characterization of the non-uniqueness regime of the parameter space in terms 
of percolation of plus spins in the Gibbs measure fit. It is due to Coniglio et al. J62| ; 
see also [ 131 1 . 



Corollary 8.4 For the Ising ferromagnet on the square lattice Z 2 with no external field 
and inverse temperature (3, the fit -probability of having an infinite plus-cluster is in 
the uniqueness regime (3 < (3 C , and 1 in the non-uniqueness regime (3 > (3 C . 

Proof: The existence of an infinite cluster of plus spins is a tail event and thus, 
by the extremality of fit, has probability or 1. The case [3 > /3 C is thus covered 
by Theorem 3^. For j3 < j3 c , fit coincides with fi^. Thus, if an infinite plus-cluster 



existed with probability 1 then, by symmetry, an infinite minus-cluster would also exist, 
in contradiction to Theorem |5.18| ; the assumptions of this theorem are satisfied by 
Proposition 4.14 . □ 

Combining the corollary above with Proposition 4.16| , we can also obtain some bounds 
for the percolative region of the Ising model for h ^ 0; see for a detailed discussion. 

The equivalence of non-uniqueness and percolation just observed for the Ising model 
on Z 2 cannot be expected to hold for non-planar graphs. Consider, for example, the 
Ising model on the cubic lattice Z 3 . For [3 = uniqueness certainly holds, and plus- 
percolation is equivalent to Bernoulli site percolation on Z 3 with parameter 1/2. But a 
result of @ states that p c (Z 3 ) < 1/2. The plus spins thus percolate at [3 = 0. In view 



of Proposition 4.1£, this is still the case for sufficiently small f3, so that plus-percolation 
does occur in a non-trivial part of the uniqueness region. 

For the planar graph Z 2 , however, Theorem 5.18 does not only imply the equivalence 



of phase transition and percolation, but also gives some information on the number of 
phases in the non-uniqueness region. As a warm-up let us show that, for the Ising 
ferromagnet on Z 2 at inverse temperature (3 > f3 c , there are no other translation and 
rotation invariant extremal Gibbs measures than fi^ and fi^. For, suppose another 
such phase fi existed. By Theorem |8.2j and the Burton-Keane uniqueness theorem 5.17, 



there exist unique infinite plus- and minus-clusters with ^-probability 1. As an extremal 



Gibbs measure, \x has positive correlations; recall the paragraph below Proposition 4.14 



Proposition |5.19| thus shows that fi cannot exist. 

The statement just shown is a weak version of the following result which characterizes 
all translation invariant Gibbs measures. In fact, it is sufficient to assume periodicity, 
which means invariance under the translation subgroup (0 x ) x epZ 2 f° r some p > 1. 

Proposition 8.5 Any periodic Gibbs measure \i for the Ising ferromagnet on C = Z 2 
with no external field and inverse temperature [3 > [3 C is a mixture of the two phases fi^ 
and ftp. 

Under the condition of translation invariance, this proposition was first derived for 
large (3 by Gallavotti and Miracle-Sole p8j, and later for all (3 > f3 c by Messager and 



Miracle-Sole [177] using some specific correlation inequalities; it follows also from the 



Onsager-formula for the free energy density and a result of Lebowitz [152|. We will give 
a geometric proof below. 
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Remarkably enough, one can go one step further: each (not necessarily periodic) 
Gibbs measure for the Ising model on the square lattice is a mixture of the plus-phase 
and the minus-phase, and thus automatically automorphism invariant. This beautiful 



result was obtained independently by Aizenman H and Higuchi [130] based on the 
work of Russo [|2Q8[] . For more general two-dimensional systems the absence of non- 
translation-invariant Gibbs measures at sufficiently low temperatures was proved in 
[ 39 1 . In three or more dimensions, however, non-translation invariant phases of the 
Ising model do exist; this is a famous result of Dobrushin [66], see also [^] for a short 
proof. 

Theorem 8.6 (Aizenman— Higuchi) For the Ising ferromagnet on C = Z 2 with no 
external field and inverse temperature f3 > j3 c , \i% and are the only phases, and any 
other Gibbs measure is a mixture of these two. 

The proof is a masterpiece of random-geometric analysis of equilibrium phases and 
contains various ingenious ideas, but unfortunately it is too long to be sketched here. 
For the full result we thus need to refer to the original papers cited above, as well as 
to the survey [01. However, to provide an idea of some of the geometric ideas involved 



we will now give a (new) geometric proof of Proposition This proof resulted from 
discussions of the first author with Y. Higuchi. 

In this proof we need to consider infinite clusters in halfplanes. Here, we say that a 
set TL C Z 2 is a halfplane if TL is a translate of either the upper halfplane {x = (x±,X2) G 
Z 2 : X2 > 0} or its complement, the lower halfplane, or a translate of the right and left 
halfplanes which are similarly defined. The next lemma provides a first step in the proof 



of Proposition 8.5 



Lemma 8.7 Consider the Ising ferromagnet on Z 2 , and let D be the event that for at 
least one halfplane TL ml? , both TL and TL C contain an infinite cluster of the same sign. 
Then fi(D) = 1 for all n G 9((3H) and j3 > (5 C . 

Proof: Since each Gibbs measure is a mixture of extremal Gibbs measures, we only 
need to show that fi(D) = 1 for any extremal /x. Suppose the contrary. Since D is tail 
measurable, it then follows that n(D) = for some extremal fi. We will show that this 
is impossible. 

Step 1: Let TL be any halfplane, r the reflection of Z 2 which maps TL onto TL C , and 
t : o~ — ► —a the spin flip on Vl. We show that [i = [i o r o r. Since fJ-(D) = 0, at least 
one of the halfplanes TL and TL C contains no infinite minus-cluster, and this or the other 
halfplane contains no infinite plus-cluster. In view of the tail triviality of fx, we can 
assume that TL contains no infinite minus-cluster /U-almost surely. Hence, for any given 
A G £ and //-almost every £ G $7, there exists an r-symmetric region r(£) G £ such that 
r(f) D A and £ = 1 on dT(£) n TL. The last property implies that £ >z r org) on dT((), 



and using Lemma |4.13| and the flip-reflection symmetry of H we find that 



/4,r«) ~ v = Mflr K) r o r on F A . 

Assuming that T(£) is maximal in a large box A D A, we can apply the Markov property 
of fj, in the same way as in the proof of Theorem ^Tl]. This yields that /i >zx> M ° r ° T 
on for any A, and thus [i ^-p fx o r o r. (The preceding argument is a variant of 



an idea of Russo [ 208 1 .) Using the absence of infinite plus-clusters in TL or TL C we find 



analogously that fi -<x> pror. Hence /i = jjororas claimed. 
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Step 2: Here we use a variant of Zhang's argument which was explained in the 



proof of Theorem 5.18. To begin, we observe that the composition of two reflections in 
parallel axes is a translation. Step 1 therefore implies that \x is periodic with period 2. 
The flip-reflection symmetry of /x implies further that n is different from [iL and [iZ , so 
that (by Theorem |3.2| ) there exist both an infinite plus- and an infinite minus-cluster 
/i-almost surely. By the Burton-Keane uniqueness theorem 5,17| , these infinite clusters 



are almost surely unique. We now choose a square A = [— n,n — l] 2 n Z 2 so large that 
fi(A <— — > oo) > 1 — 10~ 3 . Since \x is extremal, fi has positive correlations. By the 
argument leading to ( |23| ) we thus obtain that fi(df.A > oo) > 1 — 10~ 3 / 4 for some 
k G {1, . . . ,4}, where d^A is the intersection of dA with the fc'th quadrant (relative to 
the axes {x2 = —1/2} and {x\ = —1/2}). For definiteness, we assume that k = 1. By 
the flip-reflection symmetry, it follows that the intersection 

{d\A oo, e^A < — > 00, d%A 00, O4A < — > 00} 

has probability at least 1 — 4 • 10~ 3 / 4 > 0, which is impossible because of the uniqueness 
of the infinite clusters. This contradiction concludes the proof of the lemma. □ 



Proof of Proposition 8.5: Let \i be any Gibbs measure invariant under (0 x ) x£p z2 for 



some p > 1. Using the ergodic decomposition, we can assume that \i is in fact ergodic 



with respect to this group of translations. By Lemma 8.7 , there exists a pair (TC,Ti c ) of 
halfplanes such that, with positive probability, both H and 7i c contain infinite clusters 
of spins of the same constant sign. For definiteness, suppose TL is the upper halfplane, 
and the sign is plus. In view of the finite energy property, it then follows that also 
/i(Ao) > 0, where for k £ Z 

A k = {(k, 0) ^ 00 both in H and H c ] . 

Let A be the event that A k occurs for infinitely many k < and infinitely many k > 0. 
The horizontal periodicity and Poincare's recurrence theorem (or the ergodic theorem) 
then show that h(Aq \ A) = 0, and therefore fJ,(A) > 0. 

Next, let B be the event that there exists an infinite minus-cluster. We claim that 
/i(AnB) = 0. Indeed, suppose fi(Ar\B) > 0. Since A is tail measurable and horizontally 
periodic, we can use the finite energy property and horizontal periodicity of [i as above 
to show that the event 

C = A n {(k, 0) < — ► 00 for infinitely many k < and infinitely many k > 0} 

has positive probability. But on C there exist infinitely many minus-clusters, which is 
impossible by the Burton-Keane theorem. 

To complete the proof, we note that n(B) < ^{A c ) < 1, and thus n(B) = by 
ergodicity. In view of Theorem |8.2j , this means that {i = fip. In the case considered, 
the proposition is thus proved. The other cases are similar; in particular, in the case of 
negative sign we find that [i = [iZ. □ 



8.3 Constant-spin clusters in the Potts model 



Consider the g-state Potts model on the lattice C = Z d introduced in Section 



d > 2, and recall the results of Section 6.2 on the phase transition in this model 



The periodic ground states are the constant configurations rji = i, 1 < i < q. We 
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write <— ^— > for the agreement connectivity relation relative to T)i, and we consider the 
limiting Gibbs measure ^ q at inverse temperature (3 associated to r/j, which exists by 
Proposition 6J] . As a further illustration of assertion (|59f ) , we show that fjf^ exhibits i- 
percolation whenever there is a phase transition. This is a Potts-counterpart of Theorem 



8.2. For its proof, we use the random-cluster representation rather than Theorem 3.1 



because for q > 2 there is no stochastic monotonicity available in the spin configuration. 



Theorem 8.8 For the Potts model on Z d at any inverse temperature (3 with \Q(f3H)\ > 
1, we have ^ (x <— — > oo) > for all x G Z rf and i G {1, . . . , q}. 

Proof: By translation invariance we can choose x = 0. In Theorem 6.1C| we have seen 
that 0p )9 (O <-> oo) = c > for f3 > (3 C , where p = 1 — e -2 ^ 3 as usual. In view of (|30|), this 
means that ^ A (0 <-> A c ) > c for all A B 0. But for the Edwards-Sokal coupling A 

of ^ A and 4>pqh (defined before Proposition |6.9| ) we have {0 «-> A c } C {0 «-^-> A c } 

almost surely, so that ^ q j^(0 A c ) > c. In particular, /j^ A (0 > A c ) > c 

whenever G A C A. Letting first A f Z d and then A | Z d we find that ^ q (0 
oo) > c, and the theorem follows. □ 

Next we ask for a converse stating that "agreement percolation implies phase transition" . 
As we already noticed in the case of the Ising model, this can be expected to hold only 



in the case of a planar lattice. But then a counterpart of Corollary 8.4 does indeed hold 



as was shown by L. Chayes [56]. 



Theorem 8.9 For the unique Gibbs measure fj,g q of the q-state Potts model on the 
square lattice Z 2 at inverse temperature f3 < f3 c , we have /U/3, g (3 an infinite i-cluster) = 
for alii € {1, . . . , q}. 



The strategy of proving this theorem is the same as that in the proof of Corollary |3.4| . 
Suppose the i-spins percolate in \ig„ for some i. Then, by symmetry, this holds for 
all i, so that in particular the 1-spins and the other spins percolate. Hence, Theorem 
5.18| leads to a contradiction, provided we can show that the set of l's has positive 



correlations. Theorem 8.9 thus follows from the following lemma. 



Lemma 8.10 Consider the phase ^ q of the q-state Potts model at any inverse tem- 
perature (3 > 0, and let the mapping Sj be defined by (58) with rj = r\i, i G {1, . . . , q}. 



Then the measure vp s = o s i 1 has positive correlations. 

Sketch of proof: By symmetry, vn q does not depend on i. For definiteness we set 
i = 1 in the following. Since the property of positive correlations is preserved under 
weak limits, it is sufficient to consider the finite volume Gibbs distribution /x^ A and its 
image ^/3,q,A = q a 05 ]^- By the FKG inequality, Theorem 4.11 , it is further sufficient 
to show that ^/3, q ,A 1S monotone. In terms of /x^ ?A an( ^ ^ ne random field Y = si(X), 
this means that the conditional probability 

is increasing in £ G {0, l} z<i for any x G A. Since the boundary condition is fixed to be 
equal to 1 off A, we can assume that £ is equal to 1 off A, and it is sufficient to prove the 
inequality q x (£) < Qx((,') for any two such £, £' that differ only at a single site y G A and 
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are such that £(y) = and £'(?/) = 1. For such the inequality q x (£) < q x (£') simply 
means that Y(x) and Y(y) are positively correlated under the conditional distribution 

Px,v\t of v},q,A g iven that Y = € °ff 

To show this we fix x, y, £. For ^ x ,y\£i we have X = 1 on the complement of 
A = {x,y} U{t) £ A \ {^,y} : £(v) = 0}. We thus consider the graph G with vertex 
set A and edge set 13(A) consisting of all edges of B with both endpoints lying in A. If 
we knew that Y(x) = Y(y) = 0, then H X y\£ would be the distribution of a (q — l)-state 
Potts model on G with state space {2,3, . . . ,q}. Now that we don't know Y(x) and 
Y(y), fJ> x ,y\£ is still a modification of this (q — l)-state Potts model, in which x and y 
are allowed to have the g'th spin value 1. 

To describe this modification we suppose first that x and y are not adjacent. Let 
n x be the number of neighbors v of x with £(v) = 1, and define n y accordingly. The 
probability weight of /J> x ,y\£ then contains the additional biasing factor 

exv[2p(n x I {x{x)=1} + n y I {x{y)=1} )} 

which acts like an external field at x and y. For this modified Potts model, we can 
still define a modified random-cluster representation which gives any edge configuration 
C G {0, a probability proportional to 

(q- l) k ^\q- l + e 2f3n *) k *tt\q- l + e 2 ^)^(0 J] p« e )(l - p) 1 "^ 6 ) . 

eeS(A) 

Here p = 1 — e~ 2 ^, k(C) is the number of connected components excluding singletons at 
x or y, and k x (Q and k y (Q are the indicator functions of having a singleton connected 
component at x resp. y. A spin configuration with distribution H X y\£ is then obtained 
from the edge configuration by assigning spins at random uniformly from {2, . . . , q} 
to connected components, except for a singleton at x, where the spin is taken from 
{1, . . . , q} with probabilities proportional to (e 2 ^™ 1 , 1, . . . , 1), and similarly for a sin- 
gleton at y. Just as in Corollary this representation gives the desired positive 
correlation of Y(x) and Y(y) under fi x ,y\£, provided we can show that k x and k y are 
positively correlated in the modified random-cluster model. Since these indicator vari- 
ables are decreasing, it suffices to check that the modified random-cluster model has 



positive correlations, which follows from Theorem 4.11 by verifying that it is monotone; 



this, however, is similar to Lemma 6.6 



The case when x and y are neighbors is handled analogously; in fact, the positive 
correlation can only become stronger when x and y have an edge in common. □ 

8.4 Further examples of agreement percolation 

Here we treat the Ising antiferromagnet, the hard-core lattice gas, and the Widom- 
Rowlinson lattice model, and shortly mention the Ashkin-Teller model. 



The Ising antiferromagnet. Consider the setting of Section 3.2. We need to assume 
that the underlying lattice C is bipartite, and thus splits off into two parts, C even and 
C dd- If \h\ < 2d, there exist two periodic ground states, rj even and rj dd = —r/even, where 
Veven = 1 on C even and rjeven = — 1 on C dd- (There are no other periodic ground states, 



see for example [pq| .) The phase transition in this model has been studied in [65] and 
|129(| . Because of the bipartite structure, we can flip all spins on a sublattice as in 
(y), which turns the model into an Ising ferromagnet in a staggered magnetic field of 
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alternating sign on C even and jC dd- The latter model still satisfies the FKG inequality. 



As pointed out in Section |3.2| , for h = there is a one-to-one correspondence between all 
Gibbs measures for the Ising ferromagnet and the Ising antiferromagnet. In particular, 
both models then have the same critical inverse temperature f3 c . For general \h\ < 2d, we 
still have two limiting Gibbs measures /jP^ ven and /i^ ocW , and these measures have positive 
correlations relative to the "staggered" ordering cr ^ a' iff a(x)r] even (x) < a' (x)r} even (x) 
for all x G C. Relative to this ordering, an analogue of the sandwiching inequality (|i"5|) 
holds; for more details see Section 9 of [198]. Here is a version of statement (|59| ) for this 
model. 



Theorem 8.11 Consider the Ising antiferromagnet on a bipartite graph (C, < 
external field h at any inverse temperature (3 > 0. If \Q((3H)\ > 1, we have 



in an 



T)even 

M/3 



(X < — ► OOJ > 



for all x £ C. 



This follows from Theorem ^Tl] in the same way as Theorem ^2 
obvious counterparts of Corollary 



For £ = Z 2 , the 
and Proposition |8Jj are also valid since the proofs 
of these results carry over to the case of a staggered external field. 



The hard-core lattice gas. As we have seen in Section |3.4j , this model has state space 
S = {0,1} and corresponds to setting U(a,b) = oo/{ a= b =1 } and V(a) = —a log A in 
(H), a, b G S; A > is an activity parameter. The hard-core model is the limit of the 
Ising antiferromagnet for (3 — > oo and h — > 2d along (3 (2d — h) = ^logA, provided a 
configuration a G {—1, +1}^ is mapped to (1 — <r)/2 G {0, 1}^; see |^] for details. (The 
phase diagram point h = 2d, (3 = +oo of the Ising antiferromagnet is highly degenerate 
since there are infinitely many, in general nonperiodic, ground states.) For C = Zi d , 
the hard-core lattice gas can be seen as a gas of hard (i.e., non-overlapping) diamonds. 
In general, we still assume that C is bipartite. For A > 1, the hard-core model then 
has two periodic ground states of chessboard type, namely rj even which is equal to 1 on 
C even and otherwise, and ry G ^ = 1 
limiting Gibbs states ^ en and fi^f' 



- Veven ■ As noticed in Section |4.4| , the associated 
exist. So, following the program stated in (|5^), 
we may ask whether these Gibbs measures exibit agreement percolation in the case of 
phase transition. The answer is again positive: 



Theorem 8.12 For the hard-core model on a bipartite graph C we have for any activity 
A > 0: Iffi e x ven + n° x dd then ^"(jfc oo) > for all x G C, and similarly with 'odd' 
in place of 'even 7 . 



This result is completely analogous to Theorem 3.11 , and was conjectured by Hu and 



Mak 135 , 136 1 from computer simulations. In these papers, the authors also discuss 
the case of hard-core particles on a triangular lattice, the hard hexagon model. While 
Theorem |8.12 does apply to the hard triangle model on the hexagonal lattice (which is 
bipartite), the non-bipartite triangular lattice with nearest-neighbor bonds is excluded. 



The results of [135, |136[| suggest that Theorem S.12 still holds for the triangular lattice. 
A geometric proof of this conjecture would be of particular interest. 

The hard-core model on the square lattice Z 2 admits an analogue to Corollary |S.4| , 
in that nonuniqueness of the Gibbs measure is equivalent to r/ e?)eri -percolation for the 



Gibbs measure [i e x en ; see [100] or |117| ] for more details 
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The Widom-Rowlinson lattice model. Consider the set-up of Section 3J3, with equal 
activities A + = A_ = A > for the plus and minus particles. For A > 1 we have two 
distinct periodic ground states r/ + = +1 and r/_ = —1. From Section 4.4 we know 
that the associated limiting Gibbs measures /i^ = lim^c and ^ exist. Moreover, 
Theorem |4.17 asserts that a phase transition occurs for some activity A if and only 
if n\{X(x) = 1) > n~t{X{x) = — 1) for some x 6 C. Now, it turns out that in 
this model not only hypothesis ( |59| ) holds, but that the nonuniqueness of the Gibbs 
measure is in fact equivalent to agreement percolation, not only for the square lattice 
but for any graph. This comes from the nature of the random-cluster representation of 



Section 6.7, which is related to the sites rather than the bonds of the lattice, and is a 



curious exception from the fact that, on the whole, the Widom-Rowlinson model is less 
amenable to sharp results than the Ising model. However, by the reasons discussed in 



Section 6.7, this result does not carry over to the multitype Widom-Rowlinson lattice 
model with q > 3 types of particles. 

Theorem 8.13 Consider the Widom-Rowlinson lattice model on an arbitrary graph 
(£,~) for any activity A > 0. Then the following statements are equivalent. 

(i) The Gibbs measure for the parameter A is non-unique. 

(ii) n\{x oo) > for some, and thus all x G C 

Sketch of Proof: Consider f° r some finite A. In the same way as the random- 



cluster representation of Section l^lj was modified in Section p.2| to deal with boundary 
conditions, we can modify the site-random-cluster representation of Section [O] to obtain 
a coupling of fj? x + A and a wired site-random cluster distribution ip^ 2 A , so that analogues 
of Propositions 3.22 and 6.25 hold. As a counterpart to equation (^2|) and by the specific 
nature of the site-random-cluster representation, we then find that 



»l\{X(x) = 1) - n\\{X{x) = -1) = ^ 2A (x ~ dA) = ^ A (x A 0A) 
for all x £ C. In the limit A j C we obtain by an analogue to (|30|) 



nt(X(x) = 1) - Ai+(X(x) = -1) = n$(x ^ oo) 



and the theorem follows immediately. 



□ 



To conclude this subsection, we note that hypothesis ( |59| ) also holds in other models. 
We mention here only the Ashkin-Teller model juj], a 4-state model which interpolates 
in an interesting way between the 4-state Potts and the so called 4-state clock model, 
which is also accessible to random-cluster methods; we refer to [58|, |6l|, 192, 210]. 



8.5 Percolation of ground-energy bonds 

So far in this section we considered a number of models which are known to show a phase 
transition, and asked whether this phase transition goes hand in hand with agreement 
percolation. These results run under the heading "phase transition implies percolation" , 
even though for the square lattice we established results of converse type coming from 
the impossibility of simultaneous occupied and vacant percolation on Z 2 . 

We now take the opposite point of view and ask whether "percolation implies phase 
transition" . More precisely, we want to deduce the existence of a phase transition (at low 
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temperatures or high densities) from a percolation result. In fact, such an idea is already 
implicit in Peierls' |187| 1 and Dobrushin's (6^] proof of phase transition in the Ising 
model, and is an integral part of the Pirogov-Sinai theory. For models with neighbor 
interaction as in the Hamiltonian (|l]), the underlying principle can be sketched as follows. 
At low temperatures (or high densities), each pair of adjacent spins (or particles) tries 
to minimize its pair interaction energy. Note that this minimization involves the bonds 
rather than the sites of the lattice. So, one expects that bonds of minimal energy - the 
ground- energy bonds - prevail, forming regions separated by boundaries that consist of 
bonds of higher energy. Such boundaries, which are known as contours, cost an energy 
proportional to their size, and are therefore typically small when (5 is large. This implies 
that the ground-energy bonds should percolate. Now, the point is that if the spins along 
a bond can choose between different states of minimal energy then this ambiguity can 
be transmitted to the macroscopic level by an infinite ground-energy cluster, and this 
gives rise to phase transition. In other words, the classical contour argument for the 
existence of phase transition can be summarized in the phrase: ground-energy bond 
percolation together with a (clear-cut) non-uniqueness of the local ground state implies 
non-uniqueness of Gibbs measures. We will now describe this picture in detail. 

We consider the cubic lattice C = Z rf of dimension d > 2 with its usual graph 
structure. For definiteness we consider the Hamiltonian (|l|) for some pair potential 
U : S x S —* R. We can and will assume that the self-potential V vanishes; this is 
because otherwise we can replace U by 

U'(a,b) = U(a,b) + ^[V(a) + V(b)} , a,b G S, (61) 

which, together with the self-potential V = 0, leads to the same Hamiltonian. Let 

m = min U(a, b) (62) 

a,b£S 

be the minimal value of U. 

Given an arbitrary configuration a G we will say that an edge e = {x,y} G B 
is a ground-energy bond for a if U(a(x),o~(y)) = m. The subgraph of Z rf consisting of 
all vertices of Z rf and only the ground-energy bonds for a splits then off into connected 
components which will be called ground-energy clusters for a. We are interested in the 
existence of infinite ground-energy clusters, and we also need to identify specific such 



clusters. Unfortunately, the Burton-Keane uniqueness theorem p. 17| does not apply here 
because, for any Gibbs measure, the distribution of the set of ground-energy bonds fails 
to have the finite-energy property. We therefore resort to considering ground-energy 
clusters in any fixed two-dimensional layer of Z rf ; the uniqueness of planar infinite 
clusters can be shown in our case. (An alternative argument avoiding the use of planar 
layers but requiring stronger conditions on the temperature has been suggested in f87j.) 
In fact, we have the following result. 

Theorem 8.14 Consider the Hamiltonian (Q) on the lattice £ = Zi d , d > 2, with 
neighbor interaction U and no self-potential, and let V be any planar layer in C. (So 
V = C for d = 2.) If (3 is large enough, there exists a Gibbs measure fj, G Q{(5H) which 
is invariant under all automorphisms of £ and all symmetries of U such that 

fj,(3 a unique infinite ground-energy cluster in V) = 1 . 

In the above, a symmetry ofU is a transformation r of S such that U(ra,Tb) = U(a,b) 
for all a, b G S; such a r acts coordinatewise on configurations. 



87 



Theorem p,14| is a particular case of a result first derived in []92j] and presented in detail 
in Chapter 18 of [^]. We will sketch its proof below. The remarkable fact is that this 
type of percolation often implies that fi has a non-trivial extremal decomposition, so 
that there must be a phase transition. In fact, this happens whenever the set 

Gu = {(a,b) e 5x5: U(a, b) = m} (63) 

of bond ground states splits into sufficiently disjoint parts. To explain the underlying 
mechanism (which may be viewed as the core of the classical Peierls argument, and a 
rudimentary version of Pirogov-Sinai theory) we consider first the standard Ising model. 



Example 8.15 The Ising ferromagnet at zero external field. In this model, we have as 
usual 5 = {—1, 1}, U (a, b) = —ab for a,b £ 5, m = —1, and G\j = {(—1, —1), (1, 1)}. 
Hence, either all spins of a ground-energy cluster are negative, or else all these spins 
are positive. In other words, each ground-energy cluster is either a minus-cluster or a 
plus-cluster. This implies that 

{3 a unique infinite ground-energy cluster in V} C A- U A + , 

where A- and A + are the events that there exists an infinite cluster of negative, resp. 
positive, spins in V . For the Gibbs measure \i of Theorem 8.14| we thus have n(A- U 



A+) = 1 and, by the spin-flip symmetry of U and thus fi, fi{AJ) = fi(A+). Hence 
fi(AJ) > and fi(A+) > 0, so that the measures fi~ = fi(- |A_) and /i + = //(• \A+) 
are well-defined. Since A_, A + are tail events, it follows immediately that fjT , fi + are 
Gibbs measures for (5H. Also, A^ n A + is contained in the event that there are two 
distinct ground-energy clusters in V, and therefore has /U-measure 0. Hence /z~ and /i + 
are mutually singular, whence \Q(f3H)\ > 1. 

The same argument as in the preceding example yields the following theorem on phase 
transition by symmetry breaking. A detailed proof (in a slightly different setting) can 
be found in Section 18.2 of 



Theorem 8.16 Under the conditions of Theorem \8. 14 suppose that the set G\j defined 



by Ip 30 admits a decomposition Gjj = G% U . . . U Gn such that 

1. the sets G ri , 1 < n < N, have disjoint projections, i.e., if {a, b) G G n , (a', b') £ G n > , 
and n 7^ n' , then a 7^ a',b ^ b' , and 

2. for any two indices n, n' E {1, . . . , iV} we have f{G n ) = G n i for some transforma- 
tion f of S x 5 which is either the reflection, or the coordinatewise application of 
some symmetry of U, or a composition of both. 

Then, if (3 is sufficiently large, there exist N mutually singular Gibbs measures /x 1 , . . . fi 
G Q((3H), invariant under all even automorphisms of Z d and such that 

ii n (3 an infinite n-cluster in V) = 1 

for all 1 < n < N . In particular, there exist N distinct phases for (3H. 
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In the statement above, an infinite n-cluster for a configuration a is an infinite cluster of 
the subgraph of Z d obtained by keeping only those edges e£B with (a(x),a(y)) G G n , 
where x is the endpoint of e in the even sublattice j£ even and y £ H dd is the other 
endpoint of e. Also, an even automorphism of Z d is an automorphism leaving C even 
invariant. 

We illustrate this theorem by applying it to our other standard examples. 

Example 8.17 The Ising antiferromagnet in an external field. We have again S = 
{— 1, 1}, but the interaction is now U(a,b) = ab — A(a + b) for some constant h € R. 
(Here we applied the transformation (]6l|).) If \h\ < 2d then m = — 1 and Gjj = 
{(— 1, 1), (1, —1)}. G\j splits up into the singletons G\ = {(1, —1)} and G2 = {(—1, 1)}- 
This decomposition meets the conditions of the theorem; in particular, G\ and G2 are 
related to each other by the reflection of S x S. Consequently, there exist two mutually 
singular Gibbs measures fi 1 and n 2 which are invariant under even automorphisms and 
have an infinite cluster of chessboard type, either with plus spins on the even cluster 
sites and minus spins at the odd cluster sites, or vice versa. 

Example 8.18 The Potts model. In this case, S = {1, . . . ,q} for some integer q > 2 
and U(a,b) = 1 — 2Ir a= f J \. Again m = —1, and Gjj = {(n, n) : 1 < n < q}. Theorem 
S.16| is obviously applicable, and we recover the result that for sufficiently large (5 there 
exist q mutually singular, automorphism invariant Gibbs measures, the nth of which 
has an infinite cluster of spins with value n. 



Example 8.19 The hard-core lattice gas. This model has state space S = {0, 1} and 
neighbor interaction U of the form U(a, b) = 00 if a = b = 1, and U(a, b) = —^f-( a + b) 
for all other (a, b) S S 2 . Here A > is an activity parameter, and we have again used 
the transformation (|6l|) . For A > 1 we have Gjj = {(0, 1), (1,0)}, so that Theorem 8.16 



applies. Since multiplying U with a factor f3 amounts to changing A, we obtain that 
for sufficiently large A there exist two distinct Gibbs measures with infinite clusters of 
chessboard type, just as for the Ising antiferromagnet at low temperatures. 



Example 8.20 The Widom-Rowlinson lattice model. Here we have S = { — 1,0, 1} and 

log A 
2d 



U(a,b) = 00 if ab = -1, U(a,b) = -*f£(|a| + |6|) otherwise, a, b G S. If A > 1 then 



Gjj = {(— 1, — 1), (1, 1)}. Theorem 8.16 thus shows that for sufficiently large A there 
exist two translation invariant Gibbs measures having infinite clusters of plus- resp. 
minus-particles . 

Although the results in the examples above are weaker than those obtained by the 
random-cluster methods of Section ^ (when these apply) , the ideas presented here have 
the advantage of providing a general picture of the geometric mechanisms that imply a 



phase transition, and Theorem 3.16 can quite easily be applied. Moreover, the ideas can 
be extended immediately to systems with arbitrary state space and suitable interactions. 
In this way one obtains phase transitions in anisotropic plane rotor models, classical 
Heisenberg ferromagnets or antiferromagnets, and related iV-vector models; see Chapter 



16 of [96]. One can also consider next-nearest neighbor interactions, and thus obtain 
various other interesting examples; for this one has to consider percolation of ground- 
energy plaquettes rather than ground-energy bonds, which is in fact the set-up of fl96"|. 



Last but not least, the symmetry assumption of Theorem |8.16| can often be replaced by 
either some direct argument, or a Peierls condition in the spirit of the Pirogov-Sinai 
theory; see Chapter 19 of pH]. One such extension will be used in our next example. 



Example 8.21 First-order phase transition in the Potts model. Consider again the 
Potts model of Example |8.18| , and suppose for simplicity that d = 2. Any translate 
in Z 2 of the quadratic cell {0, l} 2 is called a plaquette. For a given configuration 
a G f2, a plaquette P is called ordered if all spins in P agree, disordered if no two 
adjacent spins in P agree, and pure if one of these two cases occurs. If q (the number of 
distinct spin values) is large enough then, for arbitrary (3, there exists an automorphism 
invariant Gibbs measure \i supported on configurations with a unique infinite cluster of 
pure plaquettes. This variant of Theorem |3.14 is due to Kotecky and Shlosman |14S 



see also Section 19.3.2 of [96]. Clearly, each cluster of pure plaquettes only contains 
plaquettes of the same type, either ordered or disordered. For some specific critical 
value f3 c {q) both possibilities must occur with positive probability; this follows from 
thermodynamic considerations, namely by convexity of the free energy as a function of 
(3 |148| , |96| ] . Conditioning on each of these two cases yields two mutually singular Gibbs 
measures with an infinite cluster of ordered resp. disordered plaquettes. Furthermore, all 
spins of a cluster of ordered plaquettes must have the same value, so that by symmetry 
the "ordered" Gibbs measure can be decomposed further into q Gibbs measures with 
infinite clusters of constant spin value. As a result, for large q and (3 = (3 c {q) there 
exist q + 1 mutually singular Gibbs measures which behave qualitatively similar to the 
disordered phase for [3 < (3 c (q) resp. the q ordered phases for f3 > f3 c (q). This is the 
first-order phase transition in the Potts model for large q. For further discussions we 



refer to 227 , 148 , 150 , |149| and the references therein. 



We now give an outline of the proof of Theorem |8.14| . 

Sketch proof of Theorem |8.14j : For simplicity we stick to the case d = 2. For 
any inverse temperature (3 > and any square box A n = [— n, n — l] 2 D Z 2 we write 
f^n f° r ^ ne Gibbs distribution relative to (3H in the box A n with periodic boundary 
condition. The latter means that A n is viewed as a torus, so that (i,n — 1) ~ (i, —n) 
and (n — l,i) ~ (— n, i) for i S [— n, n — 1] n Z; the Hamiltonian H per in A n with periodic 
boundary condition is then defined in the natural way. Let fj, per be an arbitrary limit 
point of the sequence {l^g r S)n>i- Evidently, pP er has the symmetry properties required 



of fi in Theorem and \x ver G G{(3H). 

To establish percolation of ground-energy bonds we fix some a < 1 and consider the 
wedge W = {x = (xi.,22) E C : xi > 0, \x2\ < ax{\. Let Ay\> be the event that there 
exists an infinite path of ground-energy bonds in W starting from the origin. We want 
to show that /i per (Ayy) > 3/4 when (3 is large enough. Suppose £ £" Ayy. Then there 
exists a contour crossing W, i.e., a path 7 in the dual lattice C* = 7? + (i, |) which 
crosses no ground-energy bond for £ and connects the two half-lines bordering W. For 
each such path 7 we will establish the contour estimate 

/i per ( 7 is a contour) < (l^le"^) 171 , (64) 

where I7I is the length (the number of vertices) of 7, and 5 > is such that m + 25 is 
the second lowest value of U. 

Assuming (64) we obtain the theorem as follows. The number of paths of length k 
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crossing W is at most ck 3 for some c < oo depending on a. Hence 

» per (A^)<cJ2HZ\S\e- p5 ) k <\ 

k>l 

for sufficiently large (3. By the rotation invariance of fi per , it follows that ^ per {Ao) > 0, 
where Aq is the intersection of Ayy with its three counterparts obtained by lattice 
rotations. Roughly speaking, Aq is the event that the origin belongs to two doubly 
infinite ground-energy paths, one being quasi-horizontal and the other quasi-vertical. 
Since fj, per is invariant under horizontal and vertical translations, the Poincare recurrence 
theorem (or the ergodic theorem) implies that the event 

^oo = {£, £ ^ : 8xi £ Aq for infinitely many x in each of the four half-axes} 

has also positive ^ per -probability. Each configuration in A^ has infinitely many quasi- 
horizontal and quasi-vertical ground-energy paths in each of the four directions of the 
compass, and by planarity all paths of different orientation must intersect. Therefore 
all these paths belong to the same infinite ground-energy cluster which has only finite 
holes, and is therefore unique. Hence A^ is contained in the event B that there exists 
a ground-energy cluster surrounding each finite set of C, and n per (B) > 0. As B is 
a tail-event and invariant under all automorphisms of C and all symmetries of U, the 
theorem follows by setting /x = /x per ( • \ B). 

It remains to establish the contour estimate (|64|). For this it is sufficient to show 
that 

/i^( 7 is a contour) < (^le"' 35 ) 171 (65) 

when n is so large that 7 is contained in A n . This bound is based on reflection positivity 
and the chessboard estimate, which are treated at length in Chapter 17 of p6| . Here 
we give only the principal ideas. The basic observation is the following consequence of 
the toroidal symmetry of ^ n : for any % £ {0, . . . , n— 1}, the configurations on the two 
parts = {x 6 A n : x\ > i or x\ < i — n} and A~ { = {x G A n : % — n < x\ < i} of 
A n are conditionally independent and, up to reflection, identically distributed given the 
spin values on the two separating lines {x\ = 1} and {x\ = i — n}. Hence, if /, g are real 
functions on S An depending only on the configuration in A^ i; and g® is the function 
obtained from g by reflection in these two separating lines (and thus depending on the 
configuration in A~ { ), then the bilinear form (/, g) — * l^pnifd ) ls positive definite and 
thus satisfies the Cauchy-Schwarz inequality. Similar Cauchy-Schwarz inequalities hold 
for vertical reflections. The chessboard inequality is obtained by suitable combinations 
of all these, as we will illustrate next. 

Let us mark the plaquettes around the vertices of 7 with a •; this gives I7I marked 
plaquettes. Marking a plaquette indicates that at least one of its bonds has non-minimal 
energy; leaving it unmarked does not say that it consists of ground-energy bonds, but 
that we don't need any information on its spins . In the case n = 2, this might lead to 
the picture 
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To estimate its probability we use repeatedly the Cauchy-Schwarz inequality relative to 
suitable pairs of reflection lines. Indicating each time only the pair of lines used next, 
and omitting the event that no plaquette is marked (which has probability 1), we obtain 



per 



< 



per 



1/2 



( • • • • 



< 



per 



1/4 



per 



per 
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per 



3/16 



In general, we obtain in this way 

is a contour) <^(C n )H/l A "l, 

where C n is the event that all plaquettes in A n contain at least one bond of non-minimal 
energy. But if C n occurs then at least |A n |/2 of the 2|A n | edges in A n are no ground- 
energy bonds. The Hamiltonian H^ r with periodic boundary condition is therefore at 
least (2m + 5)|A n | on C n . Since there is at least one a G S n with Hf L er {a) = 2|A n |m, 
it follows that 



This gives estimate (65) and completes the proof of Theorem 3.14 



□ 
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9 Random interactions 



So far in this review, the spin systems considered had an interaction which was invariant 
under all automorphisms of the underlying graph C Here we will assume for convenience 
that C is the cubic lattice (Z d ,B), d > 2, but the interaction between adjacent spins 
will no longer be translation invariant. That is, instead of the Hamiltonian (|l]) we now 
consider a modified Hamiltonian of the form 

H(a)= Yl JbU(a(x),a(y))+J2h x V(a(x)) (66) 

b={xy)eB xGZ d 

where the Jb and the h x may vary from bond to bond, resp. from site to site. In fact, we 
are interested in the case where these coupling coefficients show no regular structure, 
and thus assume that they are random. Such systems of spins interacting differently 
depending on their position and in a way governed by chance are known as disordered 
systems. We will not elaborate on the physical origins of such random interactions. We 
merely mention that they can be related to the presence of impurities or defects in an 
originally homogeneous system, and are used to model quenched alloys of magnetic and 



nonmagnetic materials like FeAu. For details we refer to (84|, 30, [fq| . 

We assume that the family J = (Jb)b&B of coupling coefficients and the external 
fields h = (h x ) xeZ d are independent, and each collection constitutes a family of mu- 
tually independent and identically distributed real random variables. Hence, while no 
realization of the coupling coefficients is translation invariant, we still have translation 
invariance in a statistical sense. We will not specify the underlying probability space, 
except that the letter P will be used to denote the probability measure and the asso- 
ciated expectation. The random families J and h are often referred to as the disorder. 
The disorder is called bounded if > c) = for some finite c. Physically, this is 

the most relevant case. 

(In some physical applications it is natural to assume that the Jb are not independent 
but rather have some finite-range dependence structure, but we will not include this case 
here. We also assumed for simplicity that the disorder is real valued, although some of 
the following also applies to the case when Jb or h x are allowed to take the value +oo 
with positive probability.) 

In Section |9.1| we will discuss diluted ferromagnets. A bond-diluted Ising or Potts 
ferromagnet on Z rf can be viewed as an Ising or Potts model on the open clusters for 
Bernoulli bond percolation on Z d . As observed in ||, these models can quite easily be 
understood using their random-cluster representation. They form just about the only 
class of disordered systems where the phase transition can be investigated in such detail. 



Then, in Section 9.2, we study the so-called Griffiths regime which is the only non- 
trivial regime for disordered systems where by now quite general results are available. 
It occurs at intermediate temperatures if the disorder is bounded, or at arbitrary high 
temperatures if the disorder is unbounded, and is characterized by the fact that the 
Gibbs measure is still unique but fails to have a nice high temperature behavior uni- 
formly in the disorder. The study of random Gibbs measures in the Griffiths regime 
has started in the early 1980's and has reached a satisfactory stage only recently. The 
simplest and also most powerful methods use stochastic-geometric representations and 
will be presented here. 



As representative for the large literature on the subject we refer to [13, 18, 29, 46 



72| , |84| , |85| , |86| , |l02j , |106| , |l89j , |190J. Dynamical problems (which are not touched upon 
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here) are treated e.g. in E|, M, M, [l5], ESS, jETl PHl . 



9.1 Diluted and random Ising and Potts ferromagnets 

The random Potts model is defined as follows. Spins take values in the state space 
S = { 1 , . . . , q} , and the interaction is given by the Hamiltonian ( |66| ) with 

U(a(x),a(y)) = 2 IfrMfrb)} 



and V = 0. Note that this choice of C7 and V coincides with that used in Section 6.1 for 
the standard Potts model and differs from that in Section |3.3| only by constants which 
cancel in the definition of Gibbs distributions. The Ising model corresponds to the choice 
q = 2. As for the disorder, we make the essential assumption that the random coupling 
coefficients Jf, are nonnegative, so that the interaction is still ferromagnetic. Of course, 
we also make the general assumption of this section that the are independent with 
the same distribution, say tt. A particular case of special interest is that of dilution, in 
which the Jj, take the values 1 and with probabilities p and 1 — p, respectively, which 
means that tt = p5\ + (1 — p)b~o- For p = 1 we then recover the homogeneous Potts 
model of Section p.3f 

In the following, the distribution tt of the will enter only through the quantities 

P(M = *(i - ^ Jb ) » pSM = p {tT0T^) 

for [3 > 0. Note that they do not depend on b £ £>, and that < p(/3, tt) < p((3, tt) < p 
with p = P(J b > 0). 

For a given realization J = (Jb)b&B °f the disorder and inverse temperature (3, we can 
introduce the Gibbs measure /i^j obtained from the Gibbs distributions with constant 
boundary condition i £ {1, . . . ,q} in the infinite volume limit. This limit exists by the 
arguments of Proposition |6.9| , since these use only the stochastic monotonicity coming 
from Corollary |6.7| (a) as well as the random-cluster representation, which both remain 
valid in the non-homogeneous case. 

The key quantity for phase transition, the order parameter, is the "quenched mag- 
netization" 



Indeed, an inhomogeneous version of equation Q32]) shows that m((3, tt) = P(8 q ((3J)), 
where 6 q ((3J) = 0^(0 «-> oo) is the percolation probability for the wired infinite- 
volume random-cluster measure with bond probabilities p^ = 1 — e~ 2/3Jb . Hence, we 
have m(/3, tt) > if and only if 6 q {j3i) > with positive P-probability. But whether or 
not 6q(f33) > does not depend on the value of for a single bond b. So, Kolmogorov's 
zero-one law implies that m((3,ir) > if and only if 9 q (f33) > P-almost surely, and 
by an inhomogeneous version of Theorem |6.10| the latter means that multiple Gibbs 
measures for [33 exist with P-probability 1. Moreover, an inhomogeneous version of 
relation ( |3l|) shows that 6 q {[33) is an increasing function of [33. It follows that m(/3, tt) 
is increasing in (3 and also in tt (relative to d^T>)- in particular, for each tt there exists a 
critical inverse temperature [3 c (tt), possibly = +oo, such that m([3, tt) > for [3 > [3 c {tt) 
and m(/3,7r) = for [3 < (3 c (tt). 

It remains to investigate the quenched magnetization m((3, tt). The following lemma 
shows how m(/3, tt) can be estimated in terms of Bernoulli bond percolation; recall the 
end of Section 15.11. 
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Lemma 9.1 Let 6(p) = 4> p (0 «-> oo) be the Bernoulli bond percolation probability on Z d 
with parameter p. Then 

e(p(p,ir))<m(p,ir)<e(p(p,ir)). 

Proof: We will use an inhomogeneous limiting version of the domination bounds (b) 
and (c) of Corollary |T7]. Although they were stated only in the case of a homogeneous 
interaction, they do extend also to the inhomogeneous case. Define two families p = 
(pb)b&B an d p' = (Pb)beB m terms of a realization J by p& = 1 — e~ 2 ^ Jb , p' b = (1 — 
e~ 2 P Jb )/(l + (q - l)e~ 2/3Jb ) = Pb/(pb + <?(! - Pb))- Let <f> pq be the associated wired 
random-cluster measure, and <p p , 4>p' the corresponding product measures on {0, 1} . 
An inhomogeneous version of Corollary |6.7| then shows that 



<M° ~ oo) < 0p, g (O «-> oo) < p (O «-> oo) . 

We now take the expectation with respect to P. In view of the preceding remarks, the 
middle term has P-expectation m(/3,7r), while the P-integration of the Bernoulli mea- 
sures (f) p i and (j) p again leads to Bernoulli measures, namely the homogeneous Bernoulli 
measures 4> p (i3,tt) an d <f>p(B,%)' The lemma follows immediately. □ 

Combining the lemma with the discussion before we arrive at the following result on 
phase transition in the random Potts model. 

Theorem 9.2 Consider the random Potts model on Z d at inverse temperature (3 > 
with coupling distribution tt. Set p(ir) = 7r(]0, oo[) = P(Jb > 0), and let p c be the 
Bernoulli bond percolation threshold of Zi d ; so p c = 1/2 when d = 2. 

(i) Ifp(f3,ir) < p c then with P -probability 1 there exists only one Gibbs measure with 
interaction (3 J. In particular, this holds when p(ir) < p c or (3 is small enough. 

(ii) If 7r) > p c then m(/3, 7r) > 0, and with P -probability 1 there exist q distinct 
phases for the interaction (33. In particular, this holds when p(ir) > p c and (3 is 
large enough. 

Another way of stating this result is the following. Suppose tt = (1 — p)So +P7I+ with 
7r + = 7r(- |]0, oo[), and let L + ((3) = J °° e~ l3t 7r + (dt) be the Laplace transform of 
(Note that then p((3,ir) = p(l - L + (2(3)) and p(/3,vr) > p(l - qL + {2(3)).) Then there 
is no phase transition for p < p c , whereas for p > p c the critical inverse temperature 
f3 c (p, = /3 c (7r) is finite (and decreasing in p) and satifies the bounds 

P-Pc s r (not k^P-Pc 
< L + (2/3 c (p,7T + < . 

pq p 

If 0(p c ) = (which is known to hold for d = 2, and is expected to hold for all dimensions) 
then uniqueness holds when p = p c or (3 = (3 c (p, 7r +)- In physical terminology, the 
preceding bounds on (3 c (p,^+) imply that the so-called crossover exponent is 1. 

Example 9.3 The case of dilution. If Jb is 1 or with probability p resp. 1 — p then 
L + {[3) = e - ^. Hence, for p > p c the critical inverse temperature satisfies the logarithmic 
bounds 

-ln^^<2/3 c (p,<5 1 )<-ln^^. 
p pq 

For q = 2, the diluted Ising model, assertion (ii) of Theorem gives the slightly 
sharper upper bound {3 c (P,$l) — tanh _1 (p c /p). 
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Example 9.4 The case of power law singularities. Suppose ir + (dt) = T(a)~ 1 t a ~ 1 e~ t dt 
is the Gamma distribution with parameter a > 0. Then L + (j3) = (1 + f3)~ a , so that for 
p > p c the critical inverse temperature satisfies a power law with exponent — l/o: 

„ \ l/o / n n \ l / a 

"' » 1 < 2/3 c (p,vr + ) < / 



P-PcJ \P-Pc 



Examples with other kinds of singularities can easily be produced [94]. 



Theorem 9.2 is due to |j. Earlier, a generalized Peierls argument was used in [93| 



to show that for the diluted Ising model (g = 2) in d = 2 dimensions a phase transition 
occurs almost surely when p > p c = 1/2 and j3 is large enough. In fact, this paper dealt 
mainly with the case of site dilution, in which sites rather than bonds are randomly 
removed from the lattice, and which in the present framework can be described by 
setting J( xy ) = £(a>)£(y) for a family (£{x)) xeZ d °f Bernoulli variables; the are thus 
1-dependent. This was continued in |)4|, for a class of random interaction models 
including the random-bond Ising model as considered here, obtaining improved bounds 
on p c {p, 7r_|_) for d = 2 as p [ 1/2. Extensions, in particular to d > 3, were obtained in 
fl53|| . The diluted Ising model with a non-random external field h ^ does not exhibit 
a phase transition; this was shown in HI for £ = 2i d and recently extended to quite 



general graphs in [ 123 1 



For the diluted Ising model there is also a dynamical phase transition at the point 
p = p c . For p > p c and (3 c (l,6i) < (3 < (3 c (p,Si) the relaxation to equilibrium is no 
longer exponentially fast [jD]]. This illustrates that uniqueness of the Gibbs measure 
does not in itself imply the absence of a critical phenomenon. Beside such dynamical 
phenomena, there are also some static effects of the disorder in the uniqueness regime, 
albeit these are perhaps less remarkable. These are the subject of the next subsection. 

9.2 Mixing properties in the Griffiths regime 

As we have seen above, the diluted Ising ferromagnet shows spontaneous magnetization 
when p > p c and (3 > (3 c (p) = (3 c (p,5i), and multiple Gibbs measures for /3J exist 
almost surely. In the uniqueness region when still p > p c but (3 < f3 c (p) we need to 
distinguish between two different regimes. At high temperatures when actually (3 < f3 c = 
/3 C (1), the critical inverse temperature of the undiluted system, we are in the so-called 
paramagnetic case. This is comparable to the usual uniqueness regime for translation 
invariant Ising models. At intermediate temperatures, namely when f3 c < (3 < /3 c (p), 
we encounter different behavior arising from the fact that the system starts to feel the 



disorder. This regime is called the Griffiths regime, since it was he 105 ] who discovered 
in this parameter region the phenomenon now called Griffiths' singularities. He studied 
site-diluted ferromagnets , but the arguments remain valid also in the bond-diluted case. 
The basic fact is the following: adding a complex magnetic field h to the Hamiltonian of 
the diluted Ising model we find that the partition function in a box with plus boundary 
conditions, as a function of h, can take values arbitrarily close to zero. The reason is 
that typically a large part of the box is filled by a huge cluster of interacting bonds, 
giving a contribution corresponding to an Ising partition function in the phase transition 
region. The radius of analyticity of the free energy around h = is thus zero. In other 
words, the magnetization m(j3,p,h) cannot be continued analytically from h > to 
h < through h = when p > p c and (3 > j3 c . So, the presence of macrosopic clusters 
of strongly interacting spins (on which the spins show the low temperature behavior of 
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the corresponding translation invariant system) gives rise to singular behavior. Related 
phenomena show up in a large variety of other random models, though not necessarily 
in the form of non-analyticity in the uniqueness regime; in general it may be difficult to 
pinpoint their precise nature. Nevertheless, we will speak of Griffiths' phase or Griffiths' 
regime whenever such singularities are expected to occur, even when a proof is still 
lacking. These terms then simply indicate that the usual high temperature techniques 
cannot be applied as such. 

As another illustration we consider a random Ising model with unbounded, say 
Gaussian coupling variables Jb- Then /3J& is also unbounded, even for arbitrarily small (3, 
and with high probability a large box contains a positive fraction of strongly interacting 
spins. In particular, there is no paramagnetic regime, and the whole uniqueness region 
belongs to the Griffiths phase. For this reason, it is a non-trivial problem to show the 
uniqueness of the Gibbs measure. For example, the standard Dobrushin uniqueness 
condition encountered in (jiq) (cf. [65, 70 1) is useless in this case; similarly, a naive use 



of standard cluster expansion techniques fails. In fact, these methods are bound to fail 
since they also imply analyticity which is probably too much to hope for (even though 
we cannot disprove it). 

In the following we will not deal with the singular behavior in the Griffiths phase. 
Instead, we address the problem of showing nice behavior, which we specify here as 
good mixing properties of the system. We shall present two techniques: the use of 
random-cluster representations, and the use of disagreement percolation. 

Application of random- cluster representations. Consider a random Ising model. Spins 
take values o~(x) = ±1, and the formal Hamiltonian is 

H(a) = - Yl Jb o-(x)o{y) . (67) 

b=(xy)&B 

We set /3 = 1. Let //} A be the associated Gibbs distribution in A £ £ with boundary 
condition r\ € £1. For many applications it is important to have good estimates on the 
variational distance || • ||a on A C A (see ((45|)) of these measures with different boundary 
conditions. 

Definition 9.5 The random spin system above is called exponentially weak-mixing 

with rate m > if for some C < oo and all A £ £ and A C A 

P( max ||^ A - < C\A\ e ~ md ^ , (68) 

\ 77,77' en j 

where d(A, A c ) is the Euclidean distance of A and A c . 

Various other mixing conditions can also be considered. A stronger condition requires 
that the variational distance in ( |68] ) is exponentially small in the distance between the 
set A and the region where the boundary conditions r/ and rj' really differ. One could 
also restrict A and/or A to regular boxes. See [17C, 217, 70, 71, 25, E3, 5C, 15j. 



Let us comment on the significance of the exponential weak-mixing condition above. 



Remarks: (1) Suppose condition ( p8|) holds. A straightforward application of the 
Borel-Cantelli lemma then shows that for any m! < m 

m ,a? n ||Mj,a - Mj'aIU < Cj \A\ e 
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with some realization-dependent Cj < oo P-almost surely. Integrating over rf for any 
Gibbs measure fj,j we find in particular that = lim A -|> Z d A for all rj, implying that 
Hj is the only Gibbs measure (and depends measurably on J). Moreover, noting that 
fj,j(A\B) = J fij A (A) fj,j(dr]\B) for A £ Tt± and B £ Fa^, we see that this realization- 
dependent Gibbs measure fij satisfies the exponential weak-mixing condition 

sup \m(A\B) -fJL 3 (A) | < C 3 \A\e- m ' d( - A ^ . (69) 

Ae^A , BeF A c , fij (B)>o 



(2) Condition ( |69| ) above also implies an exponential decay of covariances. Let / 
be any local observable with dependence set A G £ and g be any bounded observable 
depending only on the spins off A, where A C A. Also, let 5(f) = max ff ^/ \f(o~) — f(&')\ 
be the total oscillation of / and 5(g) that of g. The covariance g) of / and g then 

satisfies P-almost surely the inequality 

M/; g)\ < Cj | A| 5(f) 5(g) e~ m ' d ^ /2 . 

Indeed, a short computation shows that h(/; is not larger than the left-hand 
side of inequality (]69|) times 5(f) 5(g)/2, cf. inequality (8.33) in |Q. If ^ is local, a 
similar inequality holds for covariances relative to finite volume Gibbs distributions in 
sufficiently large regions with arbitrary boundary conditions. 

We will now investigate under which conditions the random Ising system with Hamil- 
tonian (|67[) is exponentially weak-mixing. We start from the estimate 



II4a-^',aIIa<< 2 ,a(A-5A) 

obtained in Theorem As before, this bound is also valid in the inhomogeneous case 
considered here, and (ftp 2 A st an ds for the wired random-cluster distribution in A with 
bond-probabilities p = p(| J|) = (pb)beB given by pb = 1 — exp[— 2| Jb\]. Next we can use 
a recent concavity result of [|l5|]: 

Lemma 9.6 Let K = (Kb)b<=B be a collection of positive real numbers and p = p(K) 
denote the collection of densities pb = 1 — e~ 2Kb . For any increasing function f , the 
expectation (fi p 2 A (/) is then a concave function of each Kb- 

Proof: For brevity we set 4>\, 2 A = 4>- For any fixed bond b we consider the functions 
F(pb) = 4>(f) and G(Kb) = F(\ — e~ 2Kb ). Using equation ( ]27| ) of Lemma 6.6 we then 
find that 

F'(Pb)=Hf9p b )-Hf)H9 Pb ) 



and 

where for % £ {0, 1} 
This implies 



F"( Pb ) = -2F'( Pb )4>(g Pb ) , 
277 b -l 



pT(i-PbY- 



G"(K b ) = -Ae- 2Kb F'(p b ) [1 + 2e~ 2K ^(g Pb )]. 

Now, F' is nonnegative because / and g Pb are increasing and 4> has positive correlations 
by Corollary |6.7| (a). Another explicit computation shows that 

1 + 2e~ 2K *4>(g Pb ) = 2cP(rj b = l)/p b - 1 > 2/(2 - Pb ) - 1 > 0, 
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where the first inequality uses the fact that the random-cluster distribution for q = 
2 dominates an independent percolation model with densities p' b = Pb/(2 — pb), see 



Corollary 6.7(c). It follows that G" < 0, proving the claimed concavity. □ 



The preceding lemma implies that 

^(Vp(|j|),2,a( A <-» dA)) < < 2jA (A <-» dA) < J2 <PI,2,a( x " 5A )> 

where p = 1 — exp[— 2P(\ J&|]. The weak mixing property (68) thus follows provided 
we can show an exponential decay of connectivity in the wired random-cluster distri- 
bution 0p 2 ,A- The latter certainly holds when p < p c because 0p 2 A ^ s dominated (on 
A) by the Bernoulli measure (f) p (cf. Corollary |6.7| (b)), and the connectivity of a sub- 



critical Bernoulli model decays exponentially fast (recall Theorem So we arrive in 
particular at the following result. 

Theorem 9.7 Consider a random Ising model on Zi d with Hamiltonian (61). If 
2P(\Jb\) < — ln(l — p c ) for the Bernoulli bond percolation threshold p c of Z d then the 
system is exponentially weak-mixing. 



As the proof above shows, the factor |A| on the right-hand side of (|68| ) can actually be 
replaced by \dA\. 

In the diluted Ising model at inverse temperature (3 (see Example |9.3|) , the condition 
of the preceding theorem reads 2f3p < — ln(l — p c ). Therefore, if d is so large that 
2(3 c Pc < — hx(l — p c ) then the theorem covers part of the Griffiths regime. This fact is 
evident in the case of an unbounded, say Gaussian, disorder because then (as explained 
above) there is no paramagnetic phase. 

Exponential weak-mixing for random Ising models can also be shown by other ap- 
plications of random-cluster domination. Let us sketch such an alternative route. Using 
the random-cluster representation, Newman |182fl showed that (pointwise in the disorder 
J) 

IIm^a-M^aIIa<2^/x+ |iA (X(x)), (70) 
x-eA 



max 

n,ri' 



where /ijt, . is the Gibbs distribution in A with plus boundary conditions for the Hamil- 



tonian ( p7[) with Jft replaced by \Jb\- On the other hand, Higuchi [132] obtained the 
estimate 

mJi^W) < E E ii lA (x( X )x(z)) 

y£A zeA: z^y 

(the superscript '/' referring to the free boundary condition), while Olivieri, Perez and 



Goulart Rosa |185| proved that 

P^ JlA (X(x)X(z))^ < fj, f J>A (X(x)X(z)) (71) 

with J = P(\Jb\). We are thus back to the standard Ising Gibbs distribution in A with 
zero external field, free boundary condition and coupling constant J. Now we can take 
advantage of the second Griffiths inequality (which we did not discuss so far in this text) 
stating that correlation functions such as on the right-hand side above are monotone in 
the coupling coefficients, see e.g. [16C ] . This implies that the right-hand side of ( |7l| ) is 



an increasing function of A and thus bounded above by its infinite volume limit. But for 
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J less than J c , the critical coupling, the Gibbs measure is unique and has an exponential 
decay of correlations, see Q. We thus find that for J < J c the right-hand side of (|7l|) 
has an exponential upper bound Cexp[— m\x — z\] for suitable constants C < oo and 
m > 0. Together with the previous estimates, we conclude that under the condition 
P(\Jb\) < J c , the random Ising model on 7i d with Hamiltonian ffilQ is exponentially 
weak-mixing. Again, this condition covers part of the Griffiths regime for the diluted 
Ising ferromagnet. 

As an alternative to the use of Griffiths' inequalities above we can also apply Theo- 



rem 3.2 and Corollary |6.7| (b), giving 

'li{ JltA (X(x)X(z))) < 



where <j) p is the bond Bernoulli measure with density p = P(l — exp[— 2| Jb\])- As 
mentioned above, its connectivity function decays exponentially fast when p < p c . We 
therefore conclude that the random Ising model on Z d is also exponentially weak-mixing 
when P(l — exp[— 2| Jf,|]) < p c . 

The above estimate ( |70"| ) does not hold when we add a random magnetic field to the 
Hamiltonian (|67|), 

H(°) = - E JbO-(x)a(y)- £ h x a(x), (72) 

b=(xy)eB x£Z d 

with i.i.d. real random variables h x independent from the J&. However, if Jb,h x > 
then we can take advantage of Section 2 of |132| ] to replace ([7(]) with 

ma f ll/*J,h,A - Mj'.h.AllA < 2 ^ fJ,+ A (X(x)), 
and we can continue as above. 

Application of disagreement percolation. As we have indicated at the end of Section 



7.1[ the idea of disagreement percolation can be applied to study the Griffiths regime 
for rather general random-interaction systems. To be specific we consider Ising spins 
with the Hamiltonian ( |72| ) . We consider the finite volume Gibbs distribution h A in 
a box A with boundary condition w ; as before, the subscripts J, h describe the random 



interaction. We are going to apply Theorems 7.1 and \T2 pointwise in the disorder. As 



in ([44|) we thus have to consider the variational single-spin oscillations 

Px h = max ll^j,h,z - Mj,h,JU (73) 

77,77 £S 2 ' ' 

which depend on the disorder J,h. Under the present assumptions, (pi ,h ) x& z d ^ s a 
1-dependent random field; this is the only property of the disorder needed below. Now, 



from Theorems 7.1 and 7.2 we can conclude that if with P-probability 1 there is no 



Bernoulli site-percolation with densities (|73| ) then, again with P-probability 1, there is 



a unique Gibbs measure for J,h. (In fact, similarly to the results of Section 7.2 the 
random system can be dominated by a random percolation system which more or less 
coincides with a stochastic-geometric representation of the diluted ferromagnet |101|].) 



The following theorem is an immediate consequence of the results of Section 7.1. 
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Theorem 9.8 An Ising system with random Hamiltonian ^7^) satisfying 



P(p 3 ' h ) < _ 

{Px j (2d -I) 2 



is exponentially weak-mixing. 



Of course, we now have to add in (^) the subscript h referring to the random external 
field. 



Proof: Consider Theorem 7.1. Evidently, the right hand side of can only increase 
if the density for an open site is set to be 1 on the odd sublattice £ dd of Z°\ Due to 
the 1-dependence of the random field p = (Px ,h ) x ez d noticed above, the remaining p^. ,h 
with x S C even are mutually independent. Taking the P-expectation in ( |46D we thus 
obtain on the right-hand side the Bernoulli percolation probability ip Pt even 
where ip Pt even is the Bernoulli measure with density p = P(p^' il ) on the even sublattice 
Ceven an d density 1 on C dd- The exponential weak-mixing property now follows by an 
argument similar to that following (18); note that, relative to ipp,eveni a path of length 
k is open with probability at most pl^/ 2 J . □ 

Remarks: (1) Suppose we add a uniform magnetic field h to the random Hamiltonian 
(|72|). Under the conditions of Theorem |9.8| it is then not too difficult to show that 
the disorder-averaged expectation P(f^j,h+h(f)) of any local function / is an infinitely 
differentiable function of h, see e.g. fl72|. 



(2) Following Theorem 7.2, Theorem |9.8| requires that the single-point densities (|73|) 
are globally small enough to prevent percolation in the associated Bernoulli model. This 
condition can be extended to so-called constructive conditions involving finite boxes 
rather than single sites p5[. 



(3) Another very powerful and transparent treatment of the Griffiths regime (based 
on very similar percolation ideas) has been developed in [72]. This paper proposes 



a technique similar to [18] and |85| of a resummation in the high temperature cluster 
expansion. The bounds then allow a probabilistic interpretation of the expansion linking 
it with a bond percolation process. 
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10 Continuum models 



All models considered so far lived on a lattice. Physical systems like real gases, however, 
are more realistically modelled by particles living in continuous space. This section is 
an outline of how some of the stochastic-geometric ideas developed in previous sections 
can be applied to a continuum setting. First, in Section 10.1, we consider the natural 
continuum analogues, based on Poisson processes, of the Bernoulli percolation models 
introduced in Section ||. Then, in Section 10.2, we consider a continuum variant of 
the Widom-Rowlinson model introduced in Section 3J3 and discuss its phase transition 
behavior. (As mentioned in Section 10.2, this continuum variant is the one originally 
considered by Widom and Rowlinson [223], so it predates the lattice model.) 



10.1 Continuum percolation 

Here we consider the basic models of continuum percolation. For a thorough treatment 
of the mathematical theory of continuum percolation we refer to Meester and Roy |173j ] . 

We first need to introduce the Poisson process on H d and its subsets. Heuristically, 
a Poisson process with intensity A > on R d is a random set X of points in R rf with 
the properties that 

(i) for any bounded Borel set A of R d with volume |A|, the number of points of X in 
A is Poisson distributed with mean A|A|, i.e., for n = 0, 1,2, . . . the probability of 
seeing exactly n points in A equals exp(— A|A|)(A|A|) n /ra! ; 

(ii) for any two disjoint such subsets Ai and A2, the numbers of points observed in Ai 
and in A2 are independent. 

For a construction of such a process, we first consider a bounded Borel set A of R d . Let 
f^A be the set of all finite subsets of A. A Poisson process on A with intensity A > is 
then given by a random element of Vt\ having distribution 7Ta,a> where 

00 A ra f f 

k\,a{F) = e" A|A| j ■■■ j J {{*'i,...,z„}eF} dxi ■ ■ ■ dx n 

n=0 

for all F G the smallest cr-field which allows us to count the number of points in 
each Borel subset of A. 

Next, let O be the set of all locally finite point configurations on R d ; locally finite 
means that any bounded set contains only finitely many points. The Poisson process 
on R^ with intensity A is a random point configuration X distributed according to the 
unique probability measure tt\ on O, which, when projected on any bounded Borel set 
A C R d , yields tt\,a- Properties (i) and (ii) above are easily verified, and make Poisson 
processes the natural analogues of Bernoulli measures for lattice models. 

To study percolation properties of the Poisson process X, we need to introduce some 
notion of connectivity. A natural way is to imagine a closed Euclidean ball B(x,R) of 
fixed radius R around each point x of the Poisson process, which leads us to considering 
the random subset X = {J x& x B{x,R) of R rf . Such random subsets are widely known as 
Boolean models. (More generally, B(x,R) could be replaced by a closed compact random 
set centered at x.) This particular Boolean model is often referred to as the Poisson 
blob model or lily pond model. Two points x, y £ X are then considered as connected 
to each other if they are connected in X, meaning that there exists a continuous path 
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from x to y in X. By scaling, there is no loss of generality in setting R = 1/2, so that 
two balls centered at x and y intersect if and only if \x — y\ < 1, where | • | denotes 
Euclidean distance. The basic result on Boolean continuum percolation, analogous to 



Theorem 5.3 for ordinary site percolation, is the following. 



Theorem 10.1 Pick a Poisson process X on H d , d > 2, with intensity X. Let X = 
[J X £X -^( x > 1/2) be the associated Boolean model, and let 9(X) denote the probability that 
the origin belongs to an unbounded connected component of X. Then there exists a 
critical value A c = X c (d) £ (0, oo) such that 9(X) = if X < A c and 9(X) > if X > X c . 

The standard proof of this result is based on a partitioning of R d into small cubes, 
reducing the problem to its lattice analogue, Theorem 5J3; see 1l73fl or [108]. 



Another continuum percolation model is the so called random connection model, or 
Poisson random edge model, which was introduced by M. Penrose [ ]188[ . Let g : [0, oo) — > 
[0, 1] be a decreasing function with bounded support (that is g{x) = when x exceeds 
some R < oo). The random connection model with intensity A and connectivity function 
g arises by taking a Poisson process X in R rf with intensity A and independently drawing 
an edge between each pair of points x and y of X with probability g(\x — y\). This setting 



includes the Boolean model, which corresponds to the choice g = I\o,i]- Theorem 10.1 
extends to this model: For g as above with g(x)dx > and dimensions d>2, there 
is a critical value A c = X c (d,g) such that infinite connected components a.s. occur (resp. 
do not occur) whenever A > A c (resp. A < A c ) in the random connection model with 
intensity A and connectivity function g. 

Much of the theory of standard (lattice) percolation has analogues for these con- 
tinuum models. An example is the uniqueness of the infinite cluster (Theorem 5.171 ), 



which goes through for the Boolean and random connection models. See [173] for this 
and much more. 



10.2 The continuum Widom— Rowlinson model 

The continuum Widom-Rowlinson model is a marked point process where the points are 
of two types: we call them plus-points and minus-points. (From now on and throughout 
this section, we drop the term "continuum" when refering to this model, and instead add 
the word "lattice" when talking about the model of Section |3.5| .) For the model defined 
on a region A C Tl d realizations take values in Qa x ^a! the first coordinate describes 
the locations of the plus-points, and the second coordinate the minus-points. There is 
a hard sphere interaction preventing two points from coming within Euclidean distance 
R from each other; again, we set R = 1 without loss of generality. This interaction 
corresponds to the Hamiltonian 

#( x >y)= 00 J {k-yi<i} ) 

xGx,yey 

x,y G Q A . 

When A is bounded, the Widom-Rowlinson model on A with intensity A is obtained 
by conditioning the Poisson product measure tv\ \ x ir\ t \ on the event that there is no 
pair of points of opposite type within unit distance from each other. The extension 
to R d is done in the usual DLR fashion: a probability measure fi on x Q is a Gibbs 
measure for the Widom-Rowlinson model at intensity A if, for any bounded Borel set A, 
the conditional distribution of the point configuration on A given the point configuration 
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on H d \ A is that of two independent Poisson processes conditioned on the event that no 
point in A is placed within unit distance from a point of the opposite type, either inside 



or outside A. The resemblence with the lattice Widom-Rowlinson model of Section 3.5 



is evident. We have the following analogue of Theorem 3.4 



Theorem 10.2 For the Widom-Rowlinson model on Tl d , d > 2, with activity X, there 
exist constants < X' c < A" < oo (depending on d) such that for A < X' c the model has 
a unique Gibbs measure, while for A > X" there are multiple Gibbs measures. 

The proof of this result splits naturally into two parts: first, we need to demonstrate 
uniqueness of Gibbs measures for A sufficiently small, and secondly we need to show non- 
uniqueness for A sufficiently large. The first half can be done by a variety of techniques. 
For instance, one can partition H d into cubes of unit sidelength and apply disagreement 
percolation (Theorem |7.l| ). Two observations are crucial in order to make this work: 
that the conditional distribution of the configuration in such a cube given everything else 
only depends on the configurations in its neighboring cubes, and that the conditional 
probability of seeing no point at all in a cube tends to 1 as A — > 0, uniformly in the 
neighbors' configurations. 

The more difficult part, the nonuniqueness for large A, was first obtained by Ruelle 
[|204|| using a Peierls-type argument. Here we shall sketch a modern stochastic-geometric 
approach using a random-cluster representation. This approach is due mainly to Chayes, 
Chayes and Kotecky [^4|] (but see also |10C( |), and works in showing both parts of 
Theorem |3.4|. The so called continuum random- cluster model is defined as follows. 



Definition 10.3 The continuum random-cluster distribution (f>\,A with intensity 
X for the compact region A C is the probability measure on Q\ with density 

f( x ) = J-2 k ^, xe!! A (74) 

with respect to the Poisson process tt\a °f intensity X; here Z\ a is a normalizing con- 
stant and fc(x) is the number of connected components of the set x = \J x&x B(x, 1/2). 

In analogy to the correspondence between the lattice Widom-Rowlinson model and its 
random-cluster representation in Propositions |6.22 and 3.23, we obtain the continuum 



random-cluster model by simply disregarding the types of the points in the Widom- 
Rowlinson model, with the same choice of the parameter A. Conversely, the Widom- 
Rowlinson model is obtained when the connected components in the continuum random- 
cluster model are assigned independent types, plus or minus with probability 1/2 each. 
To see why this is true, note that for any x € Qa there are exactly 2 k ^ elements of 
Q\ x Q\ which do not contradict the hard sphere condition of the Widom-Rowlinson 
model and which map into x when we disregard the types of the points. 

Besides the random-cluster representation, we can also take advantage of stochastic 
monotonicity properties. Let us define a partial order X on f2 x 0, by setting 

(x,y) r< (x',y') if xCx'andyDy', (75) 

so that in other words a configuration increases with respect to ^ if plus-points are added 



and minus-points are deleted. A straightforward extension of Theorem 10.4 below then 



implies that the Gibbs distributions for the Widom-Rowlinson model have positive 
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correlations relative to this order. The methods of Sections LI and L3 can therefore 
be adapted to show that the Widom-Rowlinson model on R d at intensity A admits two 
particular phases fit and [j,7, where n't is obtained as a weak limit of Gibbs measures 
on compact sets (tending to R d ) with boundary condition consisting of a dense crowd 
of plus-points, and /j,^ is obtained similarly. We also have the sandwiching relation 

V (76) 

for any Gibbs measure for the intensity A Widom-Rowlinson model on R d , so that 
uniqueness of Gibbs measures is equivalent to having fj,^ = n\. 

The Gibbs measure for the Widom-Rowlinson model on a box A with "plus" (or 
"minus") boundary condition corresponds to the wired continuum random-cluster model 
(/>} A onA where all connected components within distance 1/2 from the boundary count 



as a single component. Arguing as in Sections and |6.7| we find that uniqueness of 
Gibbs measures for the Widom-Rowlinson model is equivalent to not having any infinite 
connected components in the continuum random-cluster model. Let A c be as in Theorem 



10.1. Theorem 10.2 follows if we can show that the continuum random-cluster model 
4>\ A with sufficiently large intensity A stochastically dominates tta^a for some Ai > A c , 
whereas 4>\ A <x> n\ 2 ,A for some A2 < A c when A is sufficiently small. 

To this end we need a point process analogue of Theorem |4.8| , which is based on 
the concept of Papangelou (conditional) intensities for point processes. Suppose \i is a 
probability measure on $7 a which is absolutely continuous with density /(x) relative to 
the unit intensity Poisson process 7Ti \. For x S A and a point configuration x £ £l\ 
not containing x, the Papangelou intensity of [i at x given x is, if it exists, 

» (lW = SS^L . (77, 

Heuristically, A(x|x)dx can be interpreted as the probability of finding a point inside an 
infinitesimal region dx around x, given that the point configuration outside this region 
is x. Alternatively, A(- 1 •) can be characterized as the Radon-Nikodym density of the 
measure / fj,(dx) Exex ^,x\{i}) on A x Oa, the so-called reduced Campbell measure of 
(j,, relative to the Lebesgue measure times n |98[] . It is easily checked that the Poisson 
process 7Ta,a has Papangelou intensity A(x|x) = A. 



The following point process analogue of Theorem 4.8 was proved by Preston [19£] 
under an additional technical assumption, using a coupling of so called spatial birth- 
and-death processes similar to the coupling used in the proof of Theorem 14.81. Later, 



the full result was proved in Georgii and Kiineth |98J by a discretization argument. 

Theorem 10.4 Suppose /i and \x are probability measures on with Papangelou in- 
tensities A(- 1 •) and A(- | •) satisfying 

A(x|x) < A(x|x) 

whenever x £ A and x, x G Q\ are such that x C x. Then fi -<x> A; * n that there exists 
a coupling (X, X) of [i and jl such that X C X a.s. 



Plugging (|74f ) into (|77|) we find that the continuum random-cluster measure </>a,a has 
Papangelou intensity 

A(x|x) = \2 1 - K ^ , (78) 
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where k(x, x) is the number of connected components of (J gx .£?(?/, 1/2) intersecting 
B(x,l/2). It is a simple geometric fact that there exists a constant n max — f^max 

(d) < oo 

such that k(x,x.) < K max for all x and x; for d = 2 we may take K maa; = 5. It follows 
that 

X2 1 ~ Kmax < A(x|x) < 2A (79) 



for all x and x. Hence, applying Theorems |10.4| and 10.1 we find that <p\ A <v ^2A,Ai 



so that for A < A c /2 we obtain the absence of unbounded connected components of 
Uxgx B{ x i 1/2) in the limit A f R rf of continuum random-cluster measures. On the other 
hand, taking A > \ c 2 Kmax ~ 1 yields the presence of unbounded connected components 
in the same limit. Theorem 10. 2\ follows immediately. 



It is important to note that this approach does not allow us to show that nonunique- 
ness of Gibbs measures depends monotonically on A. The reason is similar to what we 



saw for the lattice Widom-Rowlinson model in Section 3.7: the right hand side of ( 



fails to be increasing in x. It thus remains an open problem whether one can actually 



take A' = A'' in Theorem 110.2 



There are several interesting generalizations of the Widom-Rowlinson model. Let 
us mention one of them, in which neighboring pairs of particles of the opposite type 
are not forbidden, but merely discouraged. Let h : [0, oo) — > [0, oo] be an "interspecies 
repulsion function" which is decreasing and has bounded support. For A C JH d compact 
and A > 0, the associated Gibbs distribution /J>h,\,A on x S1a is given by its density 

/(x, y) = ^ — exp ( - K\ x -y\))- 

relative to tt\,\ x 7T\ a- Infinite volume Gibbs measures onfixfl are then defined in the 



usual way. Lebowitz and Lieb [154] proved nonuniqueness of Gibbs measures for large 



A when h(x) is large enough in a neighborhood of the origin. Georgii and Haggstrom 



[97 1 later established the same behavior without this condition, and for a larger class 
of systems, using the random-cluster approach. This involves a generalization of the 
continuum random-cluster model, which arises by taking the random connection model 
of Section 10.1 with connectivity function g{x) = 1 — e~ h<yX ^ and biasing it with a 



factor 2 k ( z \ where k(z) is the number of connected components of a configuration z of 
points and edges. To establish the phase transition behavior of this "soft-core Widom- 
Rowlinson model" (i.e., uniqueness of Gibbs measures for small A and nonuniqueness 
for large A) one can basically use the same arguments as the ones sketched above for the 
standard Widom-Rowlinson model. However, due to the extra randomness of the edges 
some parts of the argument become more involved. In particular, there is no longer a 
deterministic bound (corresponding to K ma x) on how much the number of connected 
components can decrease when a point is added to the random-cluster configuration; 
thus more work is needed to obtain an analogue of the first inequality in (|7^) . 

To conclude, we note that the Widom-Rowlinson model on JH d has an obvious 
multitype analogue with q > 3 different types of particles. This multitype model still 
admits a random-cluster representation from which the existence of a phase transition 



can be derived [97|. There is, however, no partial ordering like ( |75|) giving rise to 



stochastic monotonicity or an analogue of ([76]) . 
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